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Abstract 


Objective:  1,4-Dioxane  (dioxane)  a nd  /; - n itro so d i met h y  1  a m i nc  (NDMA)  are  emerging 
groundwater  contaminants  that  are  probable  human  carcinogens.  Neither  compound  is 
significantly  attenuated  in  the  environment  by  volatilization  or  sorption  processes,  but  a  role  for 
aerobic  microbial  processes  in  their  removal  has  been  observed.  The  objective  of  this  SERDP 
project  was  to  identify  organisms,  enzymes  and  biochemical  pathways  involved  in  the  aerobic 
biodegradation  of  dioxane  and  NDMA,  in  order  to  develop  a  better  understanding  of  the  effects 
of  bacterial  degradation  on  the  fate  and  persistence  of  dioxane  and  NDMA  in  the  environment. 
This  project  focused  specifically  on  oxygenase-catalyzed  biodegradation  of  the  targeted 
compounds. 

Technical  Approach:  Bacterial  isolates  were  tested  for  degradation  of  NDMA  or  dioxane  under 
monooxygenase- inducing  conditions.  The  potential  role  of  monooxygenases  in  contaminant 
removal  was  confirmed  by  exposing  cells  to  the  monooxygenase  inhibitor  acetylene  and 
monitoring  for  loss  of  activity.  The  role  of  monooxygenases  was  also  verified  with  molecular 
techniques  by  knocking  out  or  heterologous ly  expressing  putative  monooxygenase  genes.  The 
effect  of  the  common  co-contaminants  1,1,1-trichloroethane  and  1,1-dichloroethene  on  dioxane 
degradation  was  tested,  as  was  the  eflect  of  propane  on  NDMA  degradation.  Analytical 
chemistry  techniques  were  used  to  identify  the  suite  of  transformation  intennediates  of  dioxane 
metabolism.  The  genome  of  the  dioxane  metabolizing  actinobacterium  Pseudonocardia 
dioxanivorans  strain  CB1190  was  sequenced  and  a  gene  expression  microarray  was  applied  to 
identify  genes  linked  to  dioxane  and  tetrahydrofuran  metabolism.  Amino  acid  isotopomer 
analysis  with  13C-labeled  carbon  substrates  was  used  to  demonstrate  the  activity  of  strain 
CB1 190  metabolic  pathways  during  dioxane  and  C2  substrate  metabolism.  Transformation  of 
strain  CB1 190  with  plasmid  and  transposomes  was  tested  in  order  to  develop  a  set  of  genetic 
tools  for  verifying  the  importance  of  particular  genes  in  strain  CB1190  metabolism 
Results:  NDMA  degradation  in  tested  actinobacteria  was  linked  to  propane  degradation,  and  in 
Rhodococcus  jostii  strain  RHA1  directly  to  a  propane  monooxygenase.  In  two  tested  isolates, 
propane  negatively  impacted  NDMA  degradation.  Monooxygenases  were  also  linked  to  dioxane 
degradation  in  a  number  of  tested  bacterial  isolates,  and  dioxane-trans  forming  activity  was  found 
to  be  relatively  common  among  monooxygenase-expressing  bacteria.  Both  1,1,1-trichloroethane 
and  1,1-dichloroethene  inhibited  dioxane  degradation  by  bacterial  isolates,  The  first  complete 
pathway  for  the  mineralization  of  dioxane  was  proposed  based  on  identification  of 
transfonnation  intennediates.  The  complete  sequence  of  the  strain  CB1190  genome  was  obtained 
and  both  chromosomal-  and  plasmid- encoded  genes  were  induced  by  dioxane  and  THF.  Strain 
CB1190  was  shown  to  use  the  glyoxylate  carboligase  pathway  during  dioxane  metabolism, 
which  identified  how  energy  is  obtained  by  dioxane  degradation.  The  plasmid-encoded 
monooxygenase  was  demonstrated  to  transfonn  both  dioxane  and  THF.  Strain  CB1190’s  large 
genome  encodes  for  the  utilization  of  a  wide  variety  of  carbon  and  nitrogen  compounds,  and 
annotation  of  some  carbon  utilization  pathways  was  verified.  The  transfonnation  of  strain 
CB1 190  was  achieved,  which  will  enable  further  development  of  a  genetic  system  for  this 
dioxane-metabolizing  bacterium. 

Benefits:  This  work  highlights  the  importance  of  monooxygenases  in  the  degradation  of  both 
NDMA  and  dioxane.  The  induction  of  monooxygenases  in  (co)metabolizing  bacteria  will  lead  to 
improved  methods  for  the  removal  of  NDMA  and  dioxane  from  contaminated  groundwater. 
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1.  Introduction  to  SERDP  ER1417 


Dioxane  and  NDMA-  Statement  of  Need  and  Objectives 

This  research  examined  the  biodegradation  of  the  emerging  water  contaminants  1,4- 
dioxane  (dioxane)  and  N-nitrosodimethylamine  (NDMA).  Both  dioxane  and  NDMA  are 
probable  human  carcinogens  and  confirmed  animal  carcinogens.  Neither  is  significantly 
attenuated  in  the  environment  by  volatilization  or  sorption  processes.  Due  to  its  widespread  use 
as  a  solvent  stabilizer,  dioxane  is  frequently  lound  commingled  with  chlorinated  solvents  atU.S. 
Department  of  Defense  and  Department  of  Energy  sites.  NDMA  is  found  as  a  degradation 
byproduct  in  proximity  to  aerospace  facilities  that  used  hydrazine-based  rocket  fuel. 

Although  the  carcinogenic  threats  of  dioxane  and  NDMA  have  been  understood  for  many 
years,  they  have  not  historically  been  considered  important  water  quality  issues,  mostly  due  to 
lack  of  awareness  about  their  potential  occurrence  in  drinking  water  supplies.  However,  with 
recent  advances  in  analytical  methods  and  growing  public  awareness  of  their  occurrence  in 
drinking  water  supplies,  dioxane  and  NDMA  are  emerging  as  important  water  contaminants. 
Consequently,  a  better  understanding  of  the  effects  of  bacterial  degradation  on  the  fate  and 
persistence  of  dioxane  and  NDMA  in  the  environment  is  needed. 

This  research  responded  to  SON  number  CUSON-05-01,  with  the  overall  goal  of 
identifying  organisms,  enzymes  and  biochemical  pathways  involved  in  the  aerobic 
biodegradation  of  dioxane  and  NDMA.  The  work  was  then  extended  to  examine  the  molecular 
basis  for  dioxane  degradation  by  the  actinomycete  Pseudonocar dia  dioxanivorans  strain 
CB1 190.  This  work  aimed  to  provide  a  mechanistic  understanding  of  the  degradation  of  dioxane 
and  NDMA  by  aerobic  microorganisms  and  as  such  to  provide  a  foundation  for  the 
bioremediation  of  these  contaminants  in  natural  and  engineered  systems. 

The  specific  objectives  of  the  original  piece  of  this  project  were: 

1.  Test  oxygenase-expressing  bacteria  for  NDMA  and  dioxane  degradation 

2.  Evaluate  the  mechanisms  and  pathways  of  NDMA  and  dioxane  degradation 

3.  Quantify  the  kinetics  of  NDMA  and  dioxane  oxidation  reactions  at  both  moderate  and  trace 
concentrations 

4.  Investigate  the  impacts  of  common  co-contaminants  and  inducing  substrates  on  NDMA  and 
dioxane  degradation  kinetics 

Specific  objectives  of  the  extension  were: 

5.  Sequence  and  analyze  the  genome  of  the  dioxane  metabolic  degrader  P.  dioxanivorans  strain 
CB1190. 

6.  Develop  a  whole  genome  expression  microarray  for  strain  CB1190and  apply  to  identify 
genes  important  in  metabolism  and  response  to  environmental  stresses 

7.  Analyze  metabolomics  and  fluxomics  for  strain  CB1 190  metabolism 

8.  Develop  a  genetic  system  for  strain  CB1190 

These  objectives  were  used  to  guide  experimental  activities,  which  are  described  and 
summarized  in  sections  2  through  8  of  this  document,  while  sections  9  and  10  describe  the 
implications  of  the  research  and  the  cited  literature,  respectively. 
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Background 

Environmental  Relevance  of  dioxane  and  NDMA 

Dioxane  is  classified  as  a  probable  human  carcinogen  and  a  confirmed  animal  carcinogen 
(56),  and  exposure  to  dioxane  causes  damage  to  the  liver,  kidneys  and  nervous  system  (30). 
Dioxane  is  a  cyclic  organic  ether  that  is  widely  used  as  a  stabilizer  for  chlorinated  solvents  such 
as  1,1,1-trichloroethane  (TCA).  Dioxane  is  also  used  as  a  solvent  in  paper  and  textile 
processing,  and  in  the  manufacture  of  several  organic  chemicals  (181).  As  a  result  of  its 
widespread  use  and  inproper  disposal,  dioxane  has  been  detected  in  surface  waters  and 
groundwaters  across  the  United  States,  usually  as  a  co-contaminant  of  TCA.  In  an  aquifer  in  the 
San  Gabriel  Basin,  an  eight  mile  plume  was  found  that  was  believed  to  have  several  dozen 
sources  (98).  Dioxane  has  also  been  identified  as  a  frequent  contaminant  in  the  lower 
Mississippi  River  (59).  In  Washtenaw  County,  Michigan,  groundwater  aquifers  at  a  medical 
filters  manufacturing  site  are  heavily  contaminated  with  dioxane  at  concentrations  up  to  30,000 
pg/L  (166).  At  present,  there  is  no  federal  maximum  contaminant  level  for  dioxane.  Several 
states  have  adopted  water  quality  standards  and/or  guidelines  for  dioxane  levels  ranging  from  3 
to  85  pg /L  (32,  98).  Sorption  onto  activated  carbon  and  air-stripping  techniques  have  proven  to 
be  ineffective  in  removing  dioxane  from  water  because  of  its  high  solubility  and  low  vapor 
pressure  (Henry’s  Law  constant  is  5  x  10'6  atm- m3 mol"1  at  20°  C).  Dioxane  does  not  sorb  readily 
to  soils  (log  Kow=  -0.27).  Consequently,  dioxane  groundwater  plumes  generally  expand  faster 
than  associated  chlorinated  solvent  plumes  in  aquifers  (58).  Photocatalytic  oxidation  is  possible, 
but  is  often  expensive  and  requires  ex-  situ  remediation  with  pump-and  treat  operations  (54, 

144).  Aitchison  etal  (l)have  reported  phytoremediation  of  dioxane  by  hybrid  poplar  trees.  In 
their  laboratory  studies,  there  was  rapid  uptake  of  dioxane  from  aqueous  solution  as  well  as  from 
soil,  and  most  of  the  dioxane  was  transpired  from  leaf  surfaces  instead  of  being  transfonned. 

NDMA  is  also  classified  as  a  probable  human  and  confirmed  animal  carcinogen  (7).  The 
US  EPA  has  established  a  cleanup  level  of  0.7  ng/L  for  NDMA  in  groundwater  (160),  based  on  a 
risk  assessment  target  of  an  increased  lifetime  cancer  risk  of  10"6in  drinking  water.  The 
environmental  occurrence  of  this  compound  is  regulated  to  a  concentration  about  one-one 
thousandth  that  of  dioxane  with  a  California  Department  of  Health  action  level  o f  1 0  ng/L  (22). 
NDMA  is  highly  water-soluble,  and  though  volatilization  can  play  a  role  in  surface  waters,  given 
its  low  Henry's  constant  (2.6  x  10'7  atm- m3  mol'1  at  20°  C),  volatilization  should  bean 
insignificant  process  in  groundwater  (97,  107).  NDMA  does  not  significantly  sorb  to  activated 
carbon,  and  it  is  highly  mobile  in  soil  regardless  of  the  organic  carbon,  humic  acid,  pH,  clay,  or 
cation  exchange  capacity  of  the  soil  (29,  44,  63).  NDMA’s  presence  in  the  environment  has 
been  linked  to  aerospace  facilities  through  the  decomposition  of  hydrazine-based  rocket  fuels 
(21)  and  to  the  discharge  of  chlorine-based  disinfected  water  and  wastewater  (21,  100,  109).  Its 
persistence  in  groundwater  aquifers  has  been  responsible  for  the  closure  of  at  least  two  municipal 
drinking  water  wells  in  Orange  County,  California  (109).  NDMA  groundwater  concentrations  as 
high  as  400,000  ng/L  on-site  and  20,000  ng/L  off-site  have  been  detected  at  a  rocket  engine 
testing  facility  in  Sacramento  County,  California,  that  used  unsymmetrical  dimethylhydrazine 
(UDMH)-based  rocket  fuel.  These  extreme  concentrations  prompted  closure  of  downgradient 
drinking  water  wells  at  this  site  (21).  NDMA  has  also  been  detected  in  groundwater 
downgradient  of  a  rocket  engine  testing  facility  in  the  San  Gabriel  Valley,  California,  at 
concentrations  up  to  3,000  ng/L  (21)  and  groundwater  containing  350  ng/L  NDMA  has  been 
documented  at  the  Rocky  Mountain  Arsenal  in  Commerce  City,  Colorado. 
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Existing  knowledge  ofdioxane  and  NDMAbiotrans formation  prior  to  start  of  project 

Bioremediation  may  be  a  cost-effective  alternative  for  the  treatment  of  dioxane  and 
NDMA;  however,  prior  to  the  start  of  this  project,  little  was  known  about  the  microbial 
biodegradation  of  these  compounds. 

There  was  some  evidence  for  biodegradation  of  dioxane  by  aerobic  microorganisms. 
Bacterial  strains,  Mycobacterium  vaccae  and  Rhodococcus  sp.,  as  well  as  a  fungus 
Aureobasidium  pullmans,  were  shown  to  degrade  dioxane  cometabolically,  but  dioxane 
biodegradation  did  not  support  growth  of  these  organisms  (13,  19,  115).  Aerobic  biodegradation 
of  high  concentrations  ofdioxane  by  mixed  cultures  in  industrial  sludge  has  also  been  reported 
(125,  127).  Parales  etal.  (112)  showed  that  a  nocardioform  culture,  CB1190,  was  capable  of 
growing  on  dioxane.  One  study  showed  dioxane  degradation  by  cometabolism  using 
tetrahydrofuran  as  the  primary  substrate;  however,  tetrahydrofuran  is  a  toxic  chemical  itself  and 
is  thus  unsuitable  for  enhancing  in-situ  bioremediation  (180). 

The  literature  associated  with  the  microbial  degradation  of  NDMA  was  even  more  sparse. 
In  the  non-microbial  world,  the  microsomal  fractions  of  tulip  bulbs  (145),  spinach,  and  lettuce 
(29)  had  all  been  shown  to  metabolize  NDMA.  In  addition,  many  higher  organisms  including 
mammals  possess  enzymes  that  have  the  ability  to  degrade  NDMA  (74,  157).  In  the  microbial 
world,  undefined  consortia  grown  aerobically  had  demonstrated  the  ability  to  the  degrade 
NDMA  in  laboratory  incubations  (44,63,  151).  Biological  activity  was  implicated  for  the 
disappearance  of  NDMA  at  a  field  site  in  Colorado  (44).  Kaplan  and  Kaplan  (63)  observed  that 
an  activated  carbon  trickling  filter  acted  as  a  growth  matrix  for  NDMA  degrading  organisms 
while  a  sterilized  column  did  not  remove  the  NDMA.  However,  attempts  to  isolate  bacterial 
strains  from  these  NDMA-active  columns  failed.  In  fact,  there  had  been  no  reports  in  the 
literature  of  pure  cultures  capable  of  growth  on  NDMA  or  even  cometabolic  degradation  of  this 
compound. 

Formation  of  the  hypothesis:  microbial  monooxygenases  and  mammalian  P-450  enzymes 

As  described  above,  previous  studies  provided  evidence  that  dioxane  can  be  biodegraded 
aerobically  by  both  metabolic  and  cometabolic  reactions;  however,  these  studies  examined  only 
a  small  pool  of  potential  microorganisms  and  the  responsible  enzymes  and  pathways  had  not 
been  elucidated.  Similarly,  biological  activity  had  been  implicated  in  the  disappearance  of 
NDMA  in  aerobic  microcosms,  but  adequate  attempts  had  not  been  made  to  identify  the 
responsible  organisms,  determine  degradation  pathways,  characterize  the  responsible  enzymes, 
or  quantify  the  reaction  kinetics.  Indeed,  attempts  to  isolate  bacterial  strains  capable  of  growth 
on  NDMA  had  failed. 

Though  the  literature  involving  microbial  metabolism  of  dioxane  and  NDMA  was  sparse, 
the  metabolism  in  mammalian  systems  of  both  compounds  had  been  extensively  studied  due  to 
concerns  about  their  carcinogenicity  (75,  157,  170,  177).  Evidence  from  animal  studies 
indicated  that  biotransformation  of  these  compounds  likely  involves  NADH-dependent  oxidation 
by  cytochrome  P-450  monooxygenase  enzymes.  Monooxygenases  are  a  generic  class  of 
enzymes  that  catalyze  the  addition  of  one  atom  of  molecular  oxygen  directly  into  an  organic 
substrate.  The  second  oxygen  atom  is  reduced  to  water  either  by  the  substrate  itself  or  by  a 
cosubstrate  reductant  such  as  NADH.  All  monooxygenases  contain  a  cofactor,  a  transition 
metal,  flavin  orpteridine,  which  overcomes  the  spin- forbidden  reaction  between  the 
paramagnetic  dioxygen  and  the  singlet  carbon  in  organic  compounds.  A  monooxygenation  is 
separated  into  two  steps.  First,  two  reducing  equivalents  generated  by  the  oxidation  of 
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NAD(P)H  are  transferred  to  the  cofactor  which  is  reduced.  In  the  second  step,  dioxygen  couples 
with  the  reduced  cotactor  to  fomi  activated  oxygen  radicals  which  subsequently  hydroxylate  the 
substrate  (50). 

A  variety  of  cytochrome  P-450shave  been  identified  from  prokaryotes,  yeast,  fungi, 
plants,  insects,  and  mammals  (103).  In  prokaryotes,  monooxygenases  play  significant  roles  in 
catabolic  pathways.  Mammalian  monooxygenases  are  primarily  involved  in  the  hydroxylation  of 
steroids,  in  the  biosynthesis  of  vitamin  D  and  neurotransmitters,  and  in  the  detoxification  of 
poisonous  compounds  (50).  Eukaryotic  cytochrome  P-450’s  located  in  the  endoplasmic 
reticulum  and  mitochondria  are  membrane-bound,  while  most  bacterial  P-450’s  are  soluble.  An 
evolutionary  tree  representing  the  divergent  evolution  of  prokaryotic  and  eukaryotic  P-450 
monooxygenases  based  upon  a  comparison  of  the  primary  structures  of  154  members  of  the 
cytochrome  P-450  superfamily  has  been  described  (103)  and  mechanisms  such  as  genetic 
transfer  mediated  by  plasmids,  chromosomal  gene  mobilization,  transduction,  transformation, 
and  transposition  have  been  proposed  to  explain  the  evolution  of  cytochrome  P-450 
monooxygenases  among  prokaryotic  and  eukaryotic  organisms.  Thus  monooxygenase  enzymes, 
though  found  in  very  different  types  of  organisms,  evolved  from  a  common  ancestor  and  for  this 
reason  share  common  traits. 

It  is  our  hypothesis  that  since  bacterial  monooxygenase  enzymes  are  similar  in  structure, 
function  and  reaction  mechanisms  to  mammalian  P-450  enzymes,  a  class  of  these  bacterial 
monooxygenases  will  have  the  ability  to  degrade  dioxane  and  NDMA  analogously  to  their 
mammalian  counterparts. 
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2.  Enzymes  and  kinetics  of  bacterial  NDMA  degradation 


This  is  a  summary  of  work  published  in: 

1.  Sharp,  J.  O.,  C.  M.  Sales,  J.  C.  LeBlanc,  J.  Liu,  T.  K.  Wood,  L.  D.  Eltis,  W.  W.  Mohn, 
and  L.  Alvarez-Cohen.  2007.  An  inducible  propane  monooxygenase  is  responsible  for  N- 
Nitrosodimethylamine  degradation  by  Rliodococcus  sp.  strain  RHA1.  AppL  Environ.  Microbiol 
73:6930-6938. 

2.  Sharp,  J.  O.,  C.  M.  Sales,  and  L.  Alvarez-Cohen.  2010.  Functional  characterization  of 
propane- enhanced  N-nitrosodimethylamine  degradation  by  two  actinomycetales.  Biotechnol. 
Bioeng.  107:924-932. 

Introduction 

Microorganisms  grown  on  substrates  such  as  propane,  methane,  and  toluene  have  been 
shown  to  rapidly  oxidize  NDMA  in  the  laboratory  (39,  138).  In  these  cases,  evidence  from 
inhibition  and  induction  experiments  along  with  observations  of  requisite  oxygen  consumption 
suggests  that  propane  monooxygenases  (PrMO),  soluble  methane  monooxygenases  (sMMO), 
and  toluene  monooxygenases  (TMO)  are  most  likely  involved  in  these  transfonnations.  In 
addition,  experiments  with  Escherichia  coli  clones  expressing  TMO  inserts  confirmed  the  role  of 
toluene  4-monooxygenase  (T4MO)  in  NDMA  oxidation,  while  cupric  selection  for  soluble  rather 
than  particulate  MMO  confirmed  the  role  of  sMMO  (138).  The  involvement  of  PrMO  is  less 
understood,  as  the  traditional  boundary  can  blur  between  enzymes  oxidizing  gaseous  and  liquid 
n-alkanes.  Liquid  alkanes  are  typically  oxidized  by  alkane  monooxygenases  (AlkMO),  but 
AlkMO  can  be  induced  by  propane  in  some  bacteria  but  not  in  others  (47,  86).  Regardless  of  the 
class  of  monooxygenase  involved,  NDMA  is  transfonned  with  little  observed  benefit  to  the  cells 
and  no  evidence  of  cellular  growth,  despite  the  production  of  oxidized  products,  including 
formaldehyde,  methylamine,  and  methanol  that  can  be  incorporated  into  primary  metabolic 
pathways  (39,  136,  176).  Limited  evidence  for  metabolism  suggests  that  non-energy- generating 
transfonnations,  such  as  cometabolic  oxidation  reactions,  play  an  important  role  in  the  biological 
attenuation  of  NDMA. 

Rhodococci  are  soil  heterotrophs  of  the  order  Actinomycetales  with  a  noted  diversity  of 
ftmctional  enzymatic  activities  (45,  80).  Collectively,  this  order  is  biologically  and  economically 
significant  for  the  production  of  a  diverse  array  of  enzymes  involved  in  the  production  of 
commercial  secondary  metabolites,  antibiotics,  and  the  metabolism  of  xenobiotic  compounds  for 
environmental  and  industrial  applications  (104).  Global  genomic,  transcriptomic,  and  ftmctional 
analyses  of  Rliodococcus  jostii  strain  RHA1  reveal  tremendous  enzymatic  diversity  with  the 
potential  to  grow  on  a  wide  variety  of  aromatic  compounds,  carbohydrates,  nitriles,  and  steroids 
as  the  sole  carbon  and  energy  sources  (see  reference  (96)  and  references  therein).  Indeed,  the 
genome  of  strain  RHA1  is  predicted  to  encode  over  200  oxygenases,  including  both  PrMO  and 
AlkMO  gene  clusters.  The  genetic  blueprint  provided  by  the  annotated  genome  and  the 
development  of  a  corresponding  global  microarray  facilitate  the  identification  of  genes 
responsible  for  physiological  traits  ofRHAl,  especially  for  growth  and  enzymatic  activity.  For 
this  reason,  strain  RHA1  was  selected  as  a  model  organism  to  better  understand  the  genetics  and 
biochemistry  of  NDMA  transfonnation. 


5 


The  goal  of  this  work  was  to  determine  the  role  of  PrMO  activity  in  the  transformation  of 
NDMA  by  strain  RHA1.  Similar  analyses  were  then  extended  to  other  actinobacteria, 
Rhodococcus  sp.  strain  RR1  and  Mycobacterium  vaccae  strain  JOB-5. 

Results  and  Discussion 

We  first  tested  the  ability  of  strain  RHA1  to  degrade  NDMA.  Cells  grown  in  rich 
medium  (LB  or  soy  broth)  or  in  pyruvate- amended  minimal  medium  degraded  NDMA  at  a  rate 
of  0.04  ±  0.01  pg  NDMA  mg  protein"1.  In  contrast,  propane-grown  RHA1  cells  removed  NDMA 
at  rates  approximately  500-fbld  higher.  The  kinetics  of  NDMA  removal  are  represented  in  Fig. 
2.1.  Monod  kinetic  parameters  for  NDMA  removal  by  propane- grown  RHA1  were  determined: 
the  maximum  NDMA  removal  rate  (Fmax)was  18  ±3  pg  NDMA  mg  protein"1  h"1  and  the  half¬ 
saturation  constant  (K$)  was  20  ±  17  pg  NDMA  L"1. 


Figure  2.1.  Constitutive  removal  of  NDMA  occurs  at  a  fraction  of  the  propane-induced  rate.  A 

Monod  kinetic  model  (line)  is  fit  to  NDMA  removal  rates  measured  after  RHA1  was  grown  on  propane 
(diamonds).  The  constitutive  NDMA  degradation  rate  at  200  pg/L  (square)  represents  average  removal 
after  independent  growth  on  three  noninducing  substrates  (pyruvate,  LB  medium,  or  soy  broth). 

In  order  to  understand  the  role  of  propane  in  the  induction  of  NDMA  removal  by  strain 
RHA1,  we  investigated  the  effect  of  propane  on  gene  expression  using  a  custom  strain  RHA1 
spotted  microarray.  Relative  to  growth  on  pyruvate,  45  genes  were  up-regulated  in  response  to 
propane,  with  the  greatest  fold  increase  (125)  observed  for  prmA  (ro00441),  encoding  the  large 
hydroxylase  subunit  of  PrMO.  The  prmA  gene  is  part  of  a  13-gene  cluster  of  which  seven 
additional  genes  had  expression  ratios  greater  than  10.  In  contrast,  a  gene  cluster  encoding  alkane 
monooxygenase  (AlkMO),  which  has  been  imp  heated  in  propane  transformation  in  some 
bacteria,  was  not  significantly  differentially  expressed  in  propane-grown  RHA1.  These 
microarray  results  were  then  validated  with  RT-qPCR  (Fig.  2.2). 


6 


Figure  2.2.  Effect  ofpropane  on  transcription  ofthree  aliphatic  monooxygenase  components  as 
quantified  by  RT-qPCR  (white  bars)  and  spotted  microarray  (grey  bars).  The  microarray  did  not 
code  for  prnrB  (NP),  preventing  its  quantification  by  that  method.  Asterisks  denote  that  the  propane- 
grown  values  are  statistically  different  from  those  of  the  pyruvate -grown  controls  based  on  Student’s  t  test. 


To  confirm  that  PrMO  is  the  primary  catalyst  for  NDMA  in  oxidation  in  RHA1, 
knockout  mutant  strains  were  generated  with  deletions  in  prmA  and  alkB  (encoding  a  subunit  of 
AlkMO).  Knockout  of  each  gene  target  did  not  affect  growth  of  RHA1  on  rich  LB  medium 
However,  while  wild-type  RHA1  and  the  alkB  mutant  grew  on  propane-amended  minimal 
medium,  the  prmA  mutant  did  not  grow  on  propane.  When  wild-type  or  the  mutant  strain  of 
RHA1  was  grown  on  LB  medium  and  washed  cells  were  exposed  to  NDMA,  the  wild-type  and 
alkB- mutant  strains  removed  NDMA  to  below  detection  limits  in  4  h,  whereas  the  prmA  mutant 
did  not  significantly  remove  any  NDMA  over  19  h. 

This  work  demonstrated  that  the  PrMO  gene  cluster  in  Rhodococcus  jostii  strain  RHA1 
encoded  both  for  growth  on  propane  and  removal  of  NDMA.  Due  to  low-level  NDMA  removal 
when  RHA1  is  grown  with  LB  or  soy  broth  rich  medium,  it  is  apparent  that  PrMO  is 
constitutive ly  expressed  and  is  upregulated  when  RHA1  is  exposed  to  propane. 

We  next  extended  our  work  examining  the  correlation  of  propane  and  NDMA 
degradation  to  two  other  actinobacteria,  Rhodococcus  sp.  RR1  and  Mycobacterium  vaccae  strain 
JOB- 5.  Each  strain  was  first  grown  on  propane  and  then  the  kinetics  of  NDMA  removal  were 
determined  by  exposing  washed  cells  to  varying  concentrations  of  NDMA.  The  maximum 
NDMA  removal  rates  Vmax  were  elatively  similar  (44  and  28  pg  NDMA  mg  protein"1  h"1  for 
strains  RR1  and  JOB- 5,  respectively),  whereas  the  half-saturation  constants  Ks  differed  by  two 
orders  of  magnitude  (36  and  2,200  pg  NDMA  L"1  for  strains  RR1  and  JOB-5,  respectively). 
When  grown  on  soy  broth  rich  medium,  strain  RR1  had  low  constitutive  NDMA  removal 
activity  (like  strain  RHA1  shown  above),  whereas  strain  JOB-5  lacked  such  constitutive  activity. 
Cells  of  both  RR1  and  JOB-5  pre-grown  with  propane  were  able  to  remove  low  concentrations 
of  NDMA,  degrading  100  ng  NDMA  L"1  to  below  the  limit  of  detection  (20  ng/L). 

The  effects  of  propane  on  NDMA  degradation  rates  were  then  determined.  For  both 
strains  RR1  and  JOB-5,  propane  inhibited  NDMA  in  proportion  to  concentration,  but  inhibition 
kinetics  analysis  revealed  complex  inhibition  kinetics  that  do  not  correspond  with  classical 
enzymatic  models.  By  using  an  alternative  approach  (142),  we  were  able  to  compare  the 
inhibitory  effects  ofpropane  on  NDMA  removal  by  the  two  strains,  and  we  found  that  JOB-5 
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was  significantly  more  sensitive  than  RR1  to  the  presence  of  propane  (Fig.  2.3).  For  example, 
when  amended  with  200  pg  NDMA  L'1,  RR1  required  7700  pg  propane  L'1  to  decrease  the 
NDMA  removal  rate  by  50%,  whereas  JOB-5  required  only  120  pg  propane  L'1  to  achieve  the 
same  effect. 
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Figure  2.3.  Inhibitory  effect  of  propane  on  NDMA  degradation.  Initial  relative  rates  for  the 
degradation  of  200  mg/L  NDMA  was  normalized  as  degradation  rate  (vn,p)  in  the  presence  of  a  given 
propane  concentration  divided  by  the  rate  (vn)  in  the  absence  of  propane.  The  concentration  of  propane  in 
the  dissolved  solution  was  adjusted  by  the  Henry’s  constant  to  account  for  liquid  phase  partitioning. 
Squares  =  relative  rate  of  NDMA  degradation  for  strain  Rhodococcus  sp.  RR1;  circles  =  relative  rate  of 
NDMA  degradation  for  strain  Mycobacterium  vaccae  JOB-5. 


To  gain  insight  into  the  presence  of  propane  and  alkane  monooxygenases  in  strain  RR1 
and  JOB-5,  we  first  developed  a prmA  degenerate  primer  set  based  on  annotated  prmA  sequences 
in  the  NCBI  Entrez  Protein  database  and  attempted  to  amplify  putative  prmA  from  DNA 
extracted  from  these  two  strains.  A  PCR  product  of  the  expected  size  (-1300  bp)  was  obtained 
only  for  strain  RR1,  and  the  prmA  fragment  was  lound  to  be  closely  related  to  the  strain  RHA1 
prmA  sequence  (91%  nucleotide  identity  and  96%  amino  acid  identity).  We  then  applied  alkB 
degenerate  primers  to  DNA  from  each  strain  and  obtained  an  expected  343  bp  fragment  from 
both  strain.  Sequence  analyses  revealed  that  RR1  possessed  one  alkB  homologue  while  JOB-5 
contained  at  least  two  distinct  copies,  which  share  60%  nucleotide  identity. 

We  examined  the  effect  of  the  presence  of  propane  on  the  expression  of  genes  identified 
above  lor  strains  RR1  and  JOB-5.  InRRl,  prmA  was  induced  70-fold  relative  to  controls  not 
exposed  to  propane,  whereas  alkB  transcription  levels  did  not  change  significantly.  Similarly,  the 
expression  of  both  alkB  homologue  in  JOB- 5  was  not  significantly  changed  due  to  the  presence 
of  propane  (Fig.  2.4). 
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Figure  2.4.  Difference  in  expression  for  candidate  propane  oxidizing  genes  when  contrasting 
propane  and  pyruvate  grown  cultures.  The  dashed  line  represents  a  fivefold  differential  expression 
threshold.  Expression  data  for  strain  RHA1  is  from  Sharp  et  al.  (137). 


Conclusion 


This  work  established  a  role  for  PrMO  in  NDMA  degradation  by  some  actinobacteria. 
The  Rhodococcus  and  Mycobacterium  tested  in  this  study  are  common  soil  bacteria  of  the  order 
Actinomycetales  that  have  previously  been  shown  to  metabolize  propane.  The  orders  of 
magnitude  differences  in  transfonnation  rates  between  constitutive  and  promoted  conditions 
reported  here  and  in  prior  studies  (39,  137)  suggest  that  the  addition  of  propane  as  an  inducing 
substrate  could  substantially  enhance  the  bioremediation  of  NDMA  in  contaminated  soils. 
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3.  Bacterial  strain  evaluation,  degradation  kinetics,  pathway  determination 
and  co-contaminant  inhibitory  effects  for  dioxane  degradation 


This  is  a  summary  of  work  published  in: 

1. Mahendra,  S.,  and  L.  Alvarez-Cohen.  2006.  Kinetics  of  1,4-dioxane  biodegradation  by 
monooxygenase-expressing  bacteria.  Environ.  Sci.  TechnoL  40:5435-5442. 

2.  Mahendra,  S.,  C.  J.  Petzold,  E.  E.  Baidoo,  J.  D.  Keasling,  and  L.  Alvarez-Cohen.  2007. 
Identification  of  the  intermediates  of  in  vivo  oxidation  of  1,4-dioxane  by  monooxygenase- 
containing  bacteria.  Environ.  Sci  TechnoL  41:7330-7336. 

3.  Mahendra,  S.,  Grostem  and  L.  Alvarez-Cohen.  2012.  The  inpact  of  common  co¬ 
contaminants  and  inducing  substrates  on  degradation  and  inhibition  kinetics  of  1,4-dioxane. 
Environ.  Sci  TechnoL,  submitted. 


Introduction 

The  metabolism  of  dioxane  in  mammalian  systems  has  been  fairly  well  studied  while 
evaluating  the  nature  of  its  carcinogenicity.  Evidence  from  animal  studies  indicates  that  dioxane 
bio  transformation  involves  NADH- dependent  oxidation  by  cytochrome  P-450  monooxygenase 
enzymes  (170,  177).  Several  ethers,  such  as  tetrahydrofuran  and  methyl  tert  butyl  ether,  have 
also  been  shown  to  degrade  via  cytochrome  P-450-catalyzed  monooxygenation  (142,  154).  Since 
bacterial  monooxygenase  enzymes  are  similar  in  structure,  ftinction,  and  reaction  mechanisms  to 
mammalian  P-450  enzymes,  it  is  logical  that  these  enzymes  may  also  have  the  ability  to  degrade 
dioxane  in  reactions  analogous  to  their  mammalian  counterparts.  Indeed,  the  broad  specificity  of 
monooxygenases  induced  by  substrates  such  as  methane  (106),  propane  (165),  toluene  (94), 
butane  (46),  and  ethene  (25)  has  been  widely  reported. 

Bioremediation  may  be  an  attractive  option  for  the  treatment  of  dioxane  as  it  has  been 
demonstrated  that  dioxane  can  be  degraded  by  bacteria  via  both  metabolic  and  cometabolic 
reactions.  While  the  metabolic  and  co-metabolic  degradation  of  dioxane  has  been  demonstrated 
in  both  bacteria  and  ftmgi,  little  research  has  been  conducted  to  identify  the  responsible  enzymes 
involved  in  the  dioxane  degradation  or  to  quantify  the  kinetics  of  the  degradation  reactions. 

In  addition,  a  complete  pathway  for  dioxane  degradation  has  not  yet  been  determined.  In 
a  study  of  the  pathway  of  bacterial  dioxane  degradation,  Vainberg  et  aL  (161)  detected  2- 
hydroxyethoxyacetic  acid  (HEAA)  as  the  terminal  product  of  dioxane  degradation  by 
Pseudonocardia  strain  ENV478.  They  suggested  that  the  degradation  was  carried  out  by  a 
monooxygenase  similar  to  putative  tetrahydrofuran  monooxygenase  of  Pseudonocardia 
tetrahydrofuranoxydans  K1  (154).  Neither  dioxane  mineralization  nor  its  assimilation  into 
biomass  was  observed.  Similarly,  HEAA  has  been  reported  as  the  major  metabolite  of  dioxane 
via  a  cytochrome  P450  monooxygenase  reaction  in  mammalian  cells  (169,  178).  An  incomplete 
dioxane  degradation  pathway  has  been  reported  lor  mammals  and  aerobic  bacteria.  In  contrast, 
HEAA  was  not  detected  during  dioxane  degradation  by  the  ftmgus  Cordyceps  sinensis,  although 
it  was  suggested  that  etherases  or  oxidases  were  involved  in  the  degradation  (101).  The  only 
intermediates  of  dioxane  degradation  identified  in  that  study  were  ethylene  glycol,  glycolic  acid, 
and  oxalic  acid. 

Finally,  dioxane  was  commonly  used  as  a  solvent  stabilizer  for  chlorinated  solvents  such 
as  1,1,1-trichloroethene  (TCA),  and  historical  practices  of  production,  storage,  and  disposal  of 
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TCA  containing  dioxane  have  caused  widespread  soil  and  groundwater  pollution  by  solvents  and 
their  stabilizers.  As  a  result,  TCA  and  its  abiotic  breakdown  product  1,1-dichloroethene  (DCE) 
are  common  co- contaminants  with  dioxane  in  groundwater  aquifers.  To  date,  the  effect  of 
chlorinated  solvents  on  dioxane  biodegradation  has  not  been  evaluated. 

The  goals  of  this  work  were  to:  evaluate  the  capabilities  of  monooxygenase-expressing 
bacteria  for  degrading  dioxane,  measure  oxidation  kinetics  for  dioxane  degradation,  determine 
the  pathway  for  the  complete  mineralization  of  dioxane,  and  examine  the  potential  inhibitory 
effects  of  co- contaminating  chlorinated  solvents  on  dioxane  degradation. 

Results  and  Discussion 


To  determine  the  correlation  between  monooxygenase  expression  and  the  ability  to 
degrade  dioxane,  we  tested  a  number  of  known  monooxygenase-expressing  bacterial  isolates  for 
their  activity  towards  dioxane  (Table  3.1).  Two  strains,  Pseudonocardia  dioxanivorans  strain 
CB1190  and  Pseudonocardia  benzenivorans  strain  B5,  were  capable  of  growing  with  dioxane  as 
the  sole  carbon  and  energy  source.  For  both  cultures  the  presence  of  acetylene  (which  inhibits 
some  monooxygenase  enzymes)  inhibited  dioxane  degradation,  and  activity  was  not  immediately 
recovered  following  acetylene  removal.  Additionally,  dioxane  degradation  did  not  occur  in  the 
absence  of  molecular  oxygen.  Combined,  this  suggests  that  the  enzymes  catalyzing  the  initial 
oxidation  of  dioxane  in  these  two  strains  are  monooxygenases.  The  cell  yields  for  strains 
CB1 190  and  B5  were  0.09  ±  0.002  and  0.03  ±  0.002  mg  protein  (mg  dioxane)'1,  respectively. 

The  growth  kinetics  of  both  strains  were  well  described  by  the  Monod  equation,  with  the 
maximum  growth  rate  (k)  of  strain  CB1 190  exceeding  that  of  strain  B5  by  a  (actor  of  10  (Fig. 
3.1). 


Table  3.1.  Bacterial  strains  tested  for  the  ability  to  degrade  dioxane. 


bacterial  strain 

growth  substrate 

oxygenase  expressed 

dioxane  degradation 

Pseudonocardia  dioxanivorans  CB1190 

dioxane 

unknown 

yes 

Pseudonocardia  benzenivorans  B5 

dioxane 

unknown 

yes 

Pseudonocardia  K1 

THF 

tetrahydrofuran  MO 

yes 

Pseudonocardia  K1 

toluene 

tetrahydrofuran  MO 

yes 

Pseudonocardia  sutfidoxvdans 

THF 

unknown 

no 

Pseudonocardia  hyd roc arbonoxy dans 

glucose 

unknown 

no 

Methytosinus  trichosporium  OB3b 

methane 

soluble  methane  MO 

yes 

Methytosinus  trichosporium  OB3b 

methane 

particulate  methane  MO  < 

no 

Mycobacterium  vaccae  JOB5 

propane 

propane  MO 

yes 

Rhodococcus  RR1 

toluene 

unknown 

yes 

Methylibium  petroleiphilum  PM1 

MTBE 

unidentified  MO 

no 

Methyiibium  petroleiphilum  PM1 

toluene 

unidentified  MO 

no 

Burkholderia  cepacia  G4 

toluene 

toluene-2-MO 

yes 

Pa  Is  ton  ia  pickettii  PKOI 

toluene 

toluene-p-MO 

yes 

Pseudomonas  mendodna  KR1 

toluene 

toluene-l-MO 

yes 

Pseudomonas  putida  mt-2 

toluene 

toluene-side  chairvMO 

no 

Escherichia  ccV/TG1(T2MO) 

LB  broth 

toluene-2-MO 

yes 

Escherichia  cafr  TGUTpMOl 

LB  broth 

toluene-p-MO 

yes 

Escherichia  cofi  TG1IT4MO) 

LB  broth 

toluene-4-MO 

yes 

Escherichia  cof/TGUTMO) 

LB  broth 

tol  uene-oxyle  ne-MO 

no 

Pseudomonas  JS150 

toluene 

toluene-2,3-  DO 

no 

Pseudomonas  putida  FI 

toluene 

toluene-2^- DO  ' 

no 

Escherichia  coli  pCR  2.1-TOPO 

LB  broth 

none 

no 
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Dioxanc  concentration  (mg/L) 

Figure  3.1.  Monod  plot  of  dioxane  degradation  by  (a)  Pseudonocardia  dioxanivorans  CB1190  and 
(b)  Pseudonocardia  benzenivorans  B5  over  a  range  of  concentrations.  The  parameters  kmax  and  Km 
were  calculated  by  nonlinear  regression  minimizing  the  difference  between  measured  and  modeled  rates 
at  each  concentration.  The  error  bars  represent  the  range  of  calculated  values  from  each  replicate. 

Co-metabolic  degradation  of  dioxane  was  detected  in  seven  bacterial  isolates  following 
growth  on  primary  substrates  known  to  induce  monooxygenases  (Table  3.1).  For  these  strains, 
inducing  substrates  leading  to  dioxane  transformation  included  methane,  propane, 
tetrahydrofuran  and  toluene.  As  with  the  metabolic  dioxane  degraders,  dioxane  degradation  by 
co-metabolizing  strains  was  inhibited  by  acetylene,  supporting  the  hypothesis  that  a 
monooxygenase  is  responsible  for  dioxane  degradation.  This  role  for  monooxygenases  was 
forther  supported  by  the  demonstration  that  recombinant  E.  coli  heterologous ly  expressing 
monooxygenase  genes  from  known  dioxane  co- metabolic  degraders  was  able  to  degrade  dioxane, 
whereas  E.  coli  with  empty  expression  vector  lacked  this  activity. 

For  most  of  the  cultures  that  degraded  dioxane  cometabolically,  degradation  ceased 
before  the  dioxane  was  completely  consumed,  indicating  that  the  resting  cells  have  a  limited 
transfonnation  capacity  for  dioxane.  This  limited  transfonnation  capacity  may  be  caused  by 
limited  reductant  (NADH)  to  fuel  the  monooxygenase,  toxicity  due  to  dioxane,  and/or  toxicity 
due  to  dioxane  metabolites.  We  examined  the  reductant  limitation  issue  with  Methylosinus 
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trichosporium  strain  0B3b  by  supplying  dioxane- degrading  cell  suspensions  with  formate, 
which  can  be  used  to  generate  NADH.  However,  the  addition  of  formate  did  not  have  any  elfect 
on  the  rate  or  extent  of  dioxane  transfonnation,  indicating  that  reductant  limitation  did  not  lead  to 
the  cessation  of  dioxane  degradation.  Given  that  OB3b  tolerates  dioxane  concentrations  up  to 
500  mg/L,  dioxane  toxicity  is  not  likely  to  cause  the  observed  phenomenon.  Alter  limited 
dioxane  degradation  OB3b  cell  suspensions  foiled  to  oxidize  methane,  suggesting  that  dioxane 
transfonnation  products  exert  irreversible  toxicity  to  the  monooxygenase  enzyme  and/or  the  cells 
in  general. 

We  next  examined  the  pathway  for  dioxane  degradation  and  metabolism  using  metabolic 
and  co-metabolic  dioxane- degrading  bacterial  strains.  Cell  suspensions  were  incubated  with 
uniformly  14C  labeled  dioxane  and  the  distribution  of  the  14C  label  in  cell  biomass,  CO2,  volatile 
and  non-volatile  fractions  overtime  was  detenuined.  Dioxane-metabolizing  strain  CB1190and 
all  four  co-metabolic  dioxane  degraders  tested  generated  14CC>2and  14C-labeled  volatile  organic 
acids,  while  only  CB1 190  accumulated  14C  label  in  biomass  (Fig.  3.2). 


O non-volatiles  □  volatile  organic  acids  DC02  ■  cell-associated 

Figure  3.2.  Final  distribution  of  14C  derived  from  the  degradation  of  dioxane  after  48  h  incubation. 

Strain  metabolizing  dioxane:  CB1190;  strains  co-metabolizing  dioxan e:  Pseudonocardia 
tetrahyclrofuranoxidans  K 1 ,  Mycobacterium  vaccae  JOB5 .Pseudomonas  mendocina  K  R I ,  E.  coli 
heterologously  expressing  toluene-4-monooxygenase  (T4MO),  Pseudomonas  putida  mt-2,  E.  coli 
heterologously  expressing  toluene/o-xylene  monooxygenase  (ToMO),  or  sterile  abiotic  control. 

For  specific  detection  of  dioxane  transfonnation  intennediates,  Fourier  transform  ion 
cyclotron  resonance-mass  spectrometry  and  direct  infusion  MS-MS  were  used.  The  post¬ 
hydroxy  lation,  post- ether  cleavage  metabolite  2-hydroxyethoxyacetic  acid  (HEAA)  was  detected 
transiently  in  samples  from  all  dioxane- degrading  strains.  HEAA  was  subsequently  degraded 
into  a  mixture  of  linear  dihydroxy-substituted  ethoxyacetic  acids.  Ethylene  glycol,  glycolic  acid, 
glyoxylic  acid,  and  oxalic  acid  were  also  detected. 

On  the  basis  of  the  intennediates  identified  during  dioxane  degradation  by 
monooxygenase-expressing  bacteria,  a  complete  biodegradation  pathway  was  proposed  (Fig.  3.3). 
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Figure  3.3.  Observed  biodegradation  pathway  of  dioxane  by  monooxygenase-expressing  bacteria  in 
this  study.  All  degradation  products  shown  in  this  pathway  were  identified  using  a  combination  of 
analytical  techniques. 

The  generation  of  CO2  from  14C-labeled  dioxane  by  strains  perforating  co-metabolic 
dioxane  degradation  was  a  surprising  result.  In  a  separate  study  using  the  dioxane  co-metabolic 
degrader  Pseudonocardia  strain  ENV478,  Vainberg  et  al.  (161)  detected  HR  A  A  as  the  terminal 
product  of  dioxane  degradation,  and  mineralization  to  CO2  was  not  found. 

Since  dioxane  contamination  often  co-occurs  in  groundwater  with  chlorinated  solvents, 
particularly  1,1,1-trichloroethane  (TCA)  and  1,1-dichloroethene  (DCE),  we  explored  how  these 
chlorinated  co-contaminants  affect  bacterial  dioxane  degradation.  Strain  CB1 190  was  pre- grown 
on  dioxane,  and  then  dioxane  degradation  by  washed  cell  suspensions  was  tested  in  the  presence 
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of  varying  concentration  of  TCA  orDCE.  Strain  CB1190  cells  did  not  degrade  either  chlorinated 
solvent.  For  both  TCA  and  DCE,  rates  of  dioxane  transformation  decreased  inversely  with  co¬ 
contaminant  concentration.  The  observed  effects  were  best  modeled  as  mixed  inhibition.  After 
exposure  to  TCA  or  DCE  and  then  subsequent  washing  and  resuspension  in  fresh  medium,  strain 
CB1190  cells  degraded  dioxane  at  rates  similar  to  those  for  cells  not  exposed  to  chlorinated 
solvent,  indicating  that  the  observed  inhibitory  effects  were  reversible. 


Figure  3.4.  Degradation  of  dioxane  by  CB 1190  in  the  presence  of  various  concentrations  of  TCA. 

Diamonds,  0  mg/L  TCA;  squares,  0.1  mg/L  TCA;  triangles,  1  mg/L  TCA;  circles,  10  mg/L  TCA.  The 
error  bars  represent  the  range  of  triplicates. 


We  next  tested  if  TCA  or  DCE  effect  dioxane  degradation  by  Pseudomonas  mendocina 
KR1  or  E.  coli  heterologously  expressing  toluene-4-monooxygenase  (E.  coli  T4MO).  These 
bacteria  degrade  dioxane  co-metabolically  after  growth  on  toluene  or  LB  broth,  respectively.  The 
presence  of  either  TCA  or  DCE  negatively  affected  dioxane  degradation  by  both  strains,  with  the 
rates  of  dioxane  transfonnation  decreasing  inversely  with  initial  co-contaminant  concentration 
(see  Fig.  3.4  for  strain  CB1190  data  with  TCA).  Unlike  with  strain  CB1190,  strain  KR1  and  E. 
coli  T4MO  degraded  TCA  and  DCE  simultaneous  with  dioxane  transfonnation.  Additionally, 
unlike  with  strain  CB1190,  TCA  and  DCE  appeared  to  irreversibly  impair  dioxane  degradation 
by  strain  KR1  and  E.  coli  T4MO,  since  dioxane  degradation  rates  were  not  restored  once  the 
cells  were  washed  and  re-suspended  in  fresh  medium  The  reason  for  this  irreversible  inhibition 
is  unclear. 

Conclusion 

This  work  confirmed  a  role  for  monooxygenases  in  dioxane  degradation  by  a  number  of 
bacterial  strains.  Kinetics  of  dioxane  degradation  were  determined,  as  was  a  pathway  for  the 
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complete  mineralization  of  dioxane  to  CCbbythe  dioxane  metabolizing  bacterium 
Pseudonocar dia  dioxanivorans  strain  CB1190.  The  biodegradation  of  dioxane  via  this  pathway 
is  unlikely  to  cause  accumulation  of  toxic  byproducts  in  the  environment.  Additionally,  the 
inhibition  of  dioxane  degradation  by  chlorinated  solvent  co-contaminants  indicates  that  these  co¬ 
contaminants  must  be  considered  when  designing  bioremediation  strategies  for  removing 
dioxane  from  polluted  groundwater  systems. 
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4.  Sequencing  and  analysis  of  the  genome  of  the  dioxane  degrading 
actinomycete  Pseudonocardia  dioxanivorans  strain  CB1190 

This  is  a  summary  of  work  published  in: 

1.  Sales,  C.  M.,  S.  Mahendra,  A.  Grostem,  R.  E.  Parales,  L.  A.  Goodwin,  T.  Woyke,  M. 
Nolan,  A.  Lapidus,  O.  Chertkov,  G.  Ovchinnikova,  A.  Sczyrba,  and  L.  Alvarez-Cohen. 

2011.  Genome  sequence  of  the  1,4- dioxane- degrading  Pseudonocardia  dioxanivorans  strain 
CB1190.  J.  BacterioL  193:4549-4550. 

2.  Sales,  C.  M.  2012.  Functional  genomics  of  bacterial  degradation  of  the  emerging 
contaminants  1,4-dioxane  and  N-nitrosodimethylamine  (NDMA).  Ph,D.  thesis.  University  of 
California,  Berkeley. 

3.  Additional  results  presented  here  are  in  publications  in  preparation. 

Introduction 

The  cyclic  ether  dioxane  is  increasingly  found  to  contaminate  groundwater.  The  potential 
toxicity  of  dioxane  has  motivated  research  into  how  to  remove  the  contaminant  from 
groundwater.  One  potential  method  is  bioremediation. 

Although  anaerobic  biodegradation  of  dioxane  has  been  observed  with  sludge  enriched 
with  iron-reducing  bacteria  (139),  most  reports  of  biodegradation  of  dioxane  by  ftmgi  and 
bacteria  have  occurred  under  aerobic  conditions.  Several  bacterial  and  ftingal  isolates  have 
exhibited  the  ability  to  degrade  dioxane  aerobically  (13,  19,  66,  89,  101,  112,  125,  127,  141,  161, 
180).  However,  only  five  of  those  isolates  are  capable  of  growing  on  dioxane  as  a  sole  carbon 
and  energy  source.  These  include  four  bacterial  strains,  Rhodococcus  ruber  219  (13), 
Pseudonocardia  dioxanivorans  CB1 190  (1 12),  Pseudonocardia  benzenivorans  B5  (89),  and 
Mycobacterium  sp.  PH-06  (66),  and  one  ftingal  species  Cordyceps  sinensis  (101). 

Described  here  is  the  genome  sequence  of  the  Gram-positive  actinomycete  P. 
dioxanivorans  strain  CB1190.  The  genome  sequence  reported  here  is  the  first  closed  genome 
sequence  in  the  genus  of  Pseudonocardia  that  is  publicly  available,  and  at  73.12%,  is  one  of  the 
highest  G+C  content  strains  ever  sequenced.  Although  454-pyrosequencing  data  is  publicly 
available  for  Pseudonocardia  sp.  PI  (Genbank  accession  ADUJ00000000),  which  was  isolated 
from  the  microbial  community  associated  with  colonies  of  the  leaf- cutting  ant  species 
Acromyrmex  octospinosus  (8),  its  assembly  is  fragmented,  with  975  contigs.  Members  of  the 
Pseudonocardia  genus  have  been  isolated  from  soils  polluted  with  aromatic  chlorinated 
compounds  (62)  and  from  glacial  soils  in  the  McMurdo  Dry  Valley  in  Antartica  (120). 
Pseudonocardia  species  have  also  been  identified  as  endosymbionts  living  in  the  tissue  of 
Chinese  medicinal  plants  (23,  43,  124)and  as  mutualistic  ectosymbionts  with  leaf-cutting  ants 
and  their  ftingal  gardens  (20,  119).  Strains  of  Pseudonocardia  have  also  exhibited  industrial  and 
environmental  uses.  For  instance,  strains  of  P.  autotrophica  produce  polyene  antifungal 
antibiotics  (82),  while  P.  tetrahydrofuranoxydans  and  P.  chloroethenivorans  degrade  the 
environmental  pollutants  tetrahydrofuran  (THF)  and  chloroethenes,  respectively  (62,  68,  83). 

P.  dioxanivorans  strain  CB1190  was  isolated  from  a  dioxane  contaminated  industrial 
sludge  in  Darlington,  South  Carolina  (1 12).  In  addition  to  growth  on  dioxane,  strain  CB1190can 
grow  on  the  ethers  1,3-dioxane,  THF,  tetrahydropyran,  2-methyl- 1,3-dioxo lane,  butyl  methyl 
ether,  diethyl  ether,  and  the  aromatic  compounds  benzene  and  toluene  (90,  112).  Strain  CB1190 
also  grows  autotrophica  lly  on  hydrogen  and  carbon  dioxide  (112),  and  it  can  fix  nitrogen  gas 
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(90).  Even  though  degradation  intennediates  have  been  identified  and  biochemical  assays  have 
suggested  the  involvement  of  monooxygenase  enzymes  (89,  91),  the  genes  that  encode  for 
enzymes  in  the  strain  CB1 190  dioxane  metabolic  pathways  are  unknown.  The  genome  sequence 
described  here  will  allow  us  to  better  understand  dioxane  metabolism,  as  well  as  explore  the 
functional  capabilities  off.  dioxanivorans  as  a  toxic  pollutant  degrader.  Furthermore,  the 
genome  sequence  will  provide  the  first  ever  insight  into  the  genetics  of  the  genus 
Pseudonocardia  in  comparison  to  the  genomes  of  medically  and  industrially  important  members 
of  the  family  Pseudonocardiaceae,  which  include  the  erythromycin  producing 
Saccharopolyspora  erythraea  NRRL2338  (108),  the  natural  pesiticide  producing 
Saccharopolyspora  spinosa  NRRL18395  (111),  the  rifamycin  producing  Amycolatopsis 
mediterranei  U32  (183)  and  Amycolatopsis  mediterranei  S699  (164),  and  the  respiratory  disease- 
causing  Saccharomonospora  viridis  DSM  43017  (114). 

Materials  and  Methods 

Culture  conditions 

Cells  of  strain  CB1 190  used  for  genomic  DNA  extraction  were  grown  in  ammonium 
mineral  salts  (AMS)  liquid  medium  (1 12).  Filter- sterilized  dioxane  was  added  to  the  culture 
medium  and  cultures  were  incubated  aerobically  while  shaking  at  30°C. 

Genomic  DNA  extraction 

Farge  quantities  of  high  molecular  weight  genomic  DNA  from  strain  CB1190  were 
isolated  by  modifying  the  CTAB  DNA  isolation  method  from  the  Department  of  Energy  Joint 
Genome  Institute  (JGI).  Genomic  DNA  was  extracted  from  cells  of  strain  CB1 190  collected  by 
filtration.  The  cells  were  scraped  from  the  filter  and  resuspended  in  740  pL  of  TE  buffer  (10  inM 
tris,  1  inM  EDTA,  pH  8.0)  in  microcentrifuge  tubes.  In  order  to  lyse  the  cells,  the  cell 
suspensions  were  gently  mixed  with  20  pE  of  lysozyme  (100  mg/mL)  and  incubated  at  room 
temperature  for  20  min.  Then,  40  pL  of  10%  SDS  (sodium  dodecyl  sulfate)  and  8  pL  of 
Proteinase  K  (10  mg/mL)  were  added  to  each  tube  and  incubated  for  12  h  at  37°C.  Next,  100  pL 
of  5  M  NaCland  100  pL  of  CTAB/NaCl  heated  to  65°C  were  added  to  each  tube  and  incubated 
at  65°C  for  10  min.  The  CTAB/NaCl  mixture  was  prepared  by  dissolving  4.1  g  of  NaCl  and  10  g 
of  CTAB  (hexadecyltrimethyl  ammonium  bromide)  in  100  mL  of  DI  water  by  constant  mixing  at 
65°C  for  3  h  and  then  sterilized  by  autoclaving.  Following  incubation  with  CTAB,  the  resultant 
lysate  was  gently  mixed  with  0.5  mL  of  chloroform:  iso  amyl  alcohol  (24:1)  and  then  centrifuged 
for  10  min  at  maximum  speed  (21,000  x  g)  for  10  min  at  room  temperature.  The  upper  aqueous 
phase  was  transferred  to  a  clean  microcentrifuge  tube  and  gently  mixed  with  0.5  mL  of 
p  he  no  l:c  h  1  o  ro  (b  r  m :  i  so  a  m  y  1  alcohol  (25:24:1 )  (pH  8.0)  and  then  centrifuged  for  10  min  at 
maximum  speed  at  room  temperature.  The  upper  aqueous  phase  was  transferred  to  a  clean 
microcentrifuge  tube  and  then  mixed  with  0.6  volumes  of  cold  isopropanol  (-20°C)  and 
incubated  at  room  temperature  for  30  min  to  precipitate  the  genomic  DNA.  The  genomic  DNA 
was  then  pelleted  by  spinning  each  tube  at  maximum  speed  (2 1 ,000  x  g)  for  1 5  min.  The  pellets 
were  washed  with  500  pL  of  70%  ethanol.  After  the  supernatant  was  discarded,  the  pellet  was 
dried  at  room  temperature  for  10  min.  Finally,  each  pellet  was  resuspended  in  20  pL  of  TE  buffer 
plus  RNase  (final  concentration  of  0.1  mg/mL).  The  size,  quantity,  and  quality  of  the  isolated 
high  molecular  weight  genomic  DNA  were  detennined  on  a  1%  agarose  geL 
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Genomic  sequencing  and  assembly 

The  draft  genome  of  P.  diox anivorans  strain  CB1190  was  generated  at  the  DOE  Joint 
genome  Institute  (JGI)  using  a  combination  of  Illumina  (12)  and  454  sequencing  technologies 
(92).  For  this  genome  we  constructed  and  sequenced  an  Illumina  GAii  shotgun  library  which 
generated  33,134,275  reads  totaling  1,192  Mb,  a  454  Titanium  standard  library  which  generated 
1,211,248  reads  and  paired-end  454  libraries  with  an  average  insert  sizes  of  3,  10,  and  18  kb 
which  generated  210,350  reads  for  a  total  of  504.6  Mb  of  454  data.  All  general  aspects  of  library 
construction  and  sequencing  performed  at  the  JGI  can  be  found  at  http://www.j gi.doe. go  v/.  The 
initial  draft  assembly  contained  283  contigs  in  8  scaffolds.  The  454  Titanium  standard  data  and 
the  454  paired  end  data  were  assembled  together  with  Newbler  [Version  2.3].  The  Newbler 
consensus  sequences  were  computationally  shredded  into  2  kb  overlapping  lake  reads  (shreds). 
Illumina  sequencing  data  was  assembled  with  VELVET  [Version  0.7.63]  (182),  and  the 
consensus  sequence  was  computationally  shredded  into  1.5  kb  overlapping  shreds.  We  integrated 
the  454  Newbler  consensus  shreds,  the  Illumina  VELVET  consensus  shreds  and  the  read  pairs  in 
the  454  paired  end  library  using  parallel  phrap  [Version  SPS  -  4.24]  (High  Perfonnance 
Software,  LLC).  The  software  Consed  (38)  was  used  in  the  following  finishing  process.  Illumina 
data  was  used  to  correct  potential  base  errors  and  increase  consensus  quality  using  the  software 
Polisher  developed  at  JGI  (Alla  Lapidus,  unpublished).  Possible  mis- assemblies  were  corrected 
using  gapResolution  (Cliff  Han,  unpublished),  Dupfinisher  (48),  or  sequencing  cloned  bridging 
PCR  fragments  with  subcloning.  Gaps  between  contigs  were  closed  by  editing  in  Consed,  by 
PCRand  by  Bubble  PCR(J-F  Cheng,  unpublished)  primer  walks.  A  total  of  809  additional 
reactions  and  6  shatter  libraries  were  necessary  to  close  gaps  and  to  raise  the  quality  of  the 
finished  sequence.  The  total  size  of  the  genome  is  7,440,794  bp  and  the  final  assembly  is  based 
on  285  Mb  of  454  draft  data  which  provides  an  average  38.  lx  coverage  of  the  genome  and  1,01 1 
Mb  of  Illumina  draft  data  which  provides  an  average  134. 8x  coverage  of  the  genome.  The 
finished  chromosome  and  plasmid  sequences  have  been  deposited  to  RelSeq  and  GenBank 
databases  with  accession  numbers  NC_0153 12-4  and  CP002593-8. 

Genome  annotation  and  analysis 

Annotation  of  the  genome  of  strain  CB1190was  accomplished  using  the  Oak  Ridge 
National  Laboratory  genome  annotation  pipeline.  Genes  were  identified  using  Prodigal  (55), 
followed  by  a  round  of  manual  curation  using  the  JGI  GenePRIMP  pipeline  (113).  The  predicted 
coding  sequences  (CDSs)  were  translated  and  used  to  search  the  National  Center  for 
Biotechnology  Information  (NCBI)  non-redundant  database,  UniProt,  TIGRFam,  Plano,  PRIAM, 
KEGG,  COG,  and  InterPro  databases.  These  data  sources  were  combined  to  assert  a  potential 
description  lor  each  predicted  protein.  Non-coding  genes  and  miscellaneous  features  were 
predicted  using  tRNAscan-SE  (87),  RNAMMer  (78),  Rlam  (42),  TMHMM  (76),  and  signalP 
(10). 

Genes  involved  in  transport  systems  were  identified  by  analyzing  putative  protein 
sequences  against  the  Transport  Classification  Database  (http://ww.tcdb.org)  (156),  using  the 
basic  local  alignment  search  tool  lor  proteins  (blastp)  (e-value  <  le-5)(4).  Signal  transduction 
proteins  were  found  using  COG  assignments  of  CDSs  and  the  Microbial  Signal  Transduction 
database  MiST2.1  (http://mistdb.com)  (159).  Protein  domains  were  identified  using  the  NCBI 
Conserved  Domain  Database  (CDD)  (http://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml)  or 
MiST2.1.  Genomic  islands  were  identified  by  SIGI-HMM  analysis  and  visualized  using 
Islandviewer  (http://www.pathogenornics.sfu.ca/islandviewer)  (79). 
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Results 


Genome  properties  and  features  ofP.  dioxanivorans  strain  CB1190 

The  genome  of  strain  CB1 190  has  a  total  size  of  7,440,794  bp  and  consists  of  four 
replicons:  a  circular  chromosome  (7,096,571  bp),  a  circular  plasmid  pPSEDOl  (192,355  bp),  an 
unclosed  circular  plasmid  pPSED02  (136,805  bp),  and  a  linear  plasmid  pPSED03  (15,063  bp) 
(Table  4.1).  The  unclosed  circular  plasmid  pPSED02  is  comprised  of  three  contigs,  with  sizes  of 
24,346  bp,  45,552  bp,  and  66,907  bp  (GenBank  Accession  numbers  CP0025 95 -7),  respectively. 
The  average  G+C  content  of  the  entire  genome  is  73.12%.  The  G+C  contents  of  the  plasmids  are 
lower  than  that  of  the  chromosome,  with  values  of  73.41%,  71.15%,  68.38%  and  61.83%  for  the 
chromosome  and  plasmids  pPSEDOl,  pPSED02,  and  pPSED03,  respectively. 

The  genome  of  strain  CB1190  contains  6,797  protein-coding  sequences  (CDS)  and  226 
pseudo  genes.  The  distribution  of  CDS,  pseudo  genes,  and  hypothetical  proteins  for  each 
replicon  is  shown  in  Table  4.1.  The  average  length  of  the  CDSs  across  the  genome  is  963  bp. 

The  chromosome  contains  6,495  CDS  and  194  pseudo  genes  while  plasmid  pPSEDOl  contains 
172  CDS  and  20  pseudo  genes,  plasmid  pPSED02  contains  1 16  CDS  and  1 1  pseudo  genes,  and 
plasmid  pPSED03  contains  14  CDS  and  no  pseudo  genes.  Of  all  predicted  proteins  in  the 
genome,  4,955  (72.90%)  have  a  putative  function  ascribed  to  them  while  1,692  genes  (26.05%) 
are  annotated  as  hypothetical  proteins.  Interestingly,  only  3  of  the  14  genes  (22.4%)  on  pPSED03 
have  a  putative  function  ascribed  to  them. 


Table  4.1.  Genome  feature  oTP.  dioxanivorans  strain  CB1190. 


Feature 

Genome 

Chromosome 

Plasmid 

pPSEDOl 

Plasmid 

pPSED02 

Plasmid 

pPSED03 

Topology 

Circular 

Circular 

Circular 

Linear 

Length 

7,440,794  bp 

7,096,571  bp 

192,355  bp 

136,805  bp 

15,603  bp 

G+C  Content 

73.12% 

73.41% 

71.15% 

68.38% 

61.83% 

Coding  Density 

87.2% 

88.5% 

76.1% 

80.0% 

69.2% 

Coding  Sequences 

6,797 

6,495 

172 

116 

14 

Pseudo  genes 

226 

194 

20 

11 

0 

Average  CDS  length 

963  bp 

967  bp 

946  bp 

851  bp 

744  bp 

rRNAs 

3 

3 

tRNAs 

47 

47 

Hypothetical  proteins 

1,842 

1,692 

88 

51 

11 

A  total  of  59  RN A- encoding  sequences  were  identified  in  the  genome  and  all  are  located 
on  the  chromosome.  Strain  CB1 190  contains  three  sets  of  16S,  23S,  and  5Sribosomal  RNA 
(rRNA)  genes,  with  one  set  located  on  the  forward  strand  and  two  on  the  reverse  strand.  Out  of 
the  structural  RNA  sequences,  47  encode  transfer  RNA  (tRNA)  genes  while  the  three  remaining 
RNAs  encode  for  a  putative  catalytic  subunit  of  the  RNaseP  rnpB,  a  4.5S  RNA  component  of  the 
signal  recognition  particle  protein  translocation  system  ffs,  and  a  small  stable  RNA  ssrA,  which 
is  involved  in  a  process  called  trans- translation  (64,  65). 

Comparative  genomics  of  Pseudonocardiaceae  family 

There  are  currently  eleven  bacterial  strains  in  the  family  Pseudonocardiacea  for  which 
genome  sequence  data  is  publicly  available:  Amycolatopsis  mediterranei  S699,  Amycolatopsis 
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mediterranei  U32,  Pseudonocardia  dioxanivorans  CB1 190  (this  study),  Pseudonocardia  sp.  PI, 
Pseudonocardia  sp.  P2,  Saccharomonospora  azurea  NA-128,  Saccharomonospora 
paurometabolica  YIM  9007,  Saccharomonaspora  viridis  DSM43017,  Saccharopolyspora 
eiyhlraea  NRRL2338,  Saccharopolyspora  spinosa  NRRL18395,  and  Thermobispora  bispora 
DSM43833.  However,  recent  phylogenetic  analysis  of  the  16S  rRNA  sequences  in  the  genome 
of  T.  bispora  DSM43833  shows  that  it  belongs  to  the  suborder  Streptosporangineae  (Liolios  et 
al.,  2010).  Both  the  suborder  Streptosporangineae  and  the  family  Pseudonocardiaceae  fell 
within  the  order  Actinomycetales.  Further  comparative  genomic  studies,  reported  herein, 
perpetuate  the  concept  that  T.  bispora  does  not  belong  to  the  family  Pseudonocardiaceae.  While 
most  members  of  the  family  Pseudonocardiaceae  are  Gram-positive,  S.  viridis  is  a  Gram¬ 
negative  bacterium  (114).  Offiirther  note,  since  two  groups  have  sequenced  the  genome  of  A 
erythraea  NRRL2338,  here  the  genome  sequenced  by  Oliynyk  et  al.  (108)  will  be  referred  to  as 
S.  erythraea  NRRL2338- 1  and  the  genome  sequenced  by  the  Italian  Ministry  of  Research 
(Genbank  ABFV00000000)  as  S.  erythraea  NRRL2338-2.  The  genome  of  S.  erythraea 
NRRL2338-1  is  closed,  while  the  genome  of  A.  erythraea  N RRL2338-2  is  comprised  of  241 
contigs. 

Table  4.2  contains  a  detailed  list  comparing  all  genome  features  of  each  of  the  sequenced 
Pseudonocardiaceae  strains,  including  T.  bispora  as  an  out-group  (. Pseudonocardia  sp.  P2  is  left 
out  because  of  the  lack  of  annotation  data).  The  genomes  of  A.  mediterranei  strains  were 
originally  sequenced  because  they  are  known  to  produce  the  antibiotic  rifemycin.  The  two 
sequenced  strains  differ  by  the  type  of  rifemycin  that  they  produce:  A.  mediterranei  S699 
produces  rifemycin  B,  while  strain  U32  produces  rifemycin  SV  (164).  The  complete  genome  of 
A.  mediterranei  S699  is  the  largest  in  the  family  Pseudonocardiaceae,  consisting  of  a  circular 
chromosome  with  a  total  length  of  10,236,779  bp.  The  genome  of  A.  mediterranei  S699  was 
assembled  by  mapping  sequencing  reads  to  the  genome  of  A.  mediterranei  U32  as  a  reference 
(164).  The  genome  of  A.  mediterranei  U32  also  contains  a  single  circular  chromosome.  The 
smallest  of  the  sequenced  genomes  in  the  femily  Pseudonocardiaceae  is  that  of  A.  viridis,  with  a 
total  length  of  4,308,349  bp.  Although  only  draft  sequences  are  available,  the  other  two 
Saccharomonospora  species  have  similar  genome  sizes  (4,770,125  bp  and  4,592,308  bp, 
respectively).  The  two  sequences  for  the  genome  of  A  erythraea  NRRL2338  vary  because  the 
genome  of  A.  erythraea  NRRL2338- 1  is  closed,  while  the  genome  of  A  erythraea  NRRL2338-2 
is  a  draft.  As  shown  in  Table  4.2,  the  draft  genome  of  A  spinosa  is  larger  than  the  genome 
sequences  for  A  erythraea.  In  addition  to  strain  CB1190,  the  only  Pseudonocardia  species  with 
genome  sequencing  data  are  Pseudonocardia  sp.  PI  and  Pseudonocardia  sp.  P2;  however,  their 
genomes  are  poorly  assembled.  Whole  genome  shotgun  sequencing  data  is  available  for 
Pseudonocardia  sp.  P2  (GenBank  AEGE00000000),  but  it  has  not  been  annotated  and  contains 
an  extremely  large  number  of  contigs.  Currently,  the  genome  of  Pseudonocardia  sp.  P2  has  been 
assembled  into  1,778  contigs  consisting  of  8. 7  Mb. 
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Table  4.2.  Features  of  sequenced  Pseudonocardia  genomes11 


Features 

P.  dioxanivorans 

A.  mediteiranei  S699 

A.  mediteiranei  U32 

Pseudonocardia.  sp.  PI 

S.  azurea 

S.  paurometabolica 

S.  viridis 

S.  eiythraea 
NRRL2338-1 

S.  eiythraea 
NRRL2338-2 

S.  spinosa 

T.  bispora 

Status 

Finished 

Finished 

Finished 

Draft 

Draft 

Draft 

Finished 

Finished 

Draft 

Draft 

Finished 

Topology 

Circular 

Circular 

Circular 

Unknown 

Unknown 

Unknown 

Circular 

Circular 

Unknown 

Unknown 

Circular 

Genome  size 

7,096,571 

10,236,779 

10,236,715 

6,388,771 

4,770,125 

4,592,308 

4,308,349 

8,212,805 

8,079,083 

8,527,776 

4,189,976 

bp 

bp 

bp 

bp 

bp 

bp 

bp 

bp 

bp 

bp 

bp 

GC  content 

73% 

71% 

71% 

73% 

70% 

71% 

67% 

71% 

71% 

68% 

72% 

Coding 

88% 

90% 

89% 

88% 

90% 

89% 

86% 

85% 

83% 

80% 

83% 

Density 

CDSsc 

6,797 

9,575 

9,228 

6,620 

4,543 

4,876 

3,828 

7,197 

7,305 

8,215 

3,546 

Pseudo 

226 

NDd 

ND 

ND 

ND 

ND 

78 

1 

ND 

ND 

50 

genes 

Hypothetical 

1,842 

3,173 

2,854 

2,206 

1,127 

1,271 

980 

2,204 

2,326 

3,339 

980 

genes 

(27.1%) 

(33.1%) 

(30.9%) 

(33.3%) 

(24.9%) 

(26.0%) 

(25.6%) 

(30.6%) 

(31.8%) 

(40.6%) 

(27.6%) 

Plasmidsb 

3,  free 

1, 

2,  integrated 

Unknown 

Unknown 

Unknown 

0 

4, 

Unknown 

Unknown 

0 

rRNA 

replicons 

3 

integrated 

4 

4 

1 

1 

1 

4 

integrated 

4 

1 

1 

3 

operons 
tRNA  genes 

47 

52 

52 

50 

48 

47 

53 

50 

50 

50 

53 

Contigs 

NAe 

NA 

NA 

875 

187 

878 

NA 

NA 

241 

118 

NA 

All  information  is  from  GenBank  and  Refseq  databases  (P.  dioxanivorans, NC  015312-4  and  CP002593 -8;  A.  mediterranei  S699,  CP002896;  A.  mediterranei 
U32,  NC_014318;  Pseudonocardia  sp.  Pi,  NZ_ADUJ00000000;  .S',  azurea,  NZ_AGIU00000000;  S.  paurometabolica,NZ_AGlT00000000',  S.viridis, 
NC_013159;  S.  eiythraea  NRRL2338-1,  NC_009142;  S.  eiythraea  NRRL2338-2,  NZ_ABFV00000000;  S.  spinosa,  NZ-AEYC00000000;  T.  bispora, 
NC_014165). 

a  T.  bispora  does  not  belong  in  the  family  Pseudonocardiaceae,  it  belongs  to  the  suborder  Streptosporangineae 

b  Information  regarding  integrated  plasmids  in  S.  en>thraea  NRRL2338-1  and  A.  mediterranei U32  are  from  te  Poele  et  al.  (152)  and  Zhao  et  al.  (183), 
respectively. 

c  CDS  -  Protein  coding  sequence 
d  ND  -  Not  determined 
e  NA  -  Not  applicable 
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Calculation  of  the  number  of  reciprocal  best  protein  blast  hits  between  each  pair  of 
genomes  (Table  4.3)  reflects  the  phytogeny,  classification,  genome  sizes,  and  completeness  of 
the  genome  assembly  of  the  strains  within  the  family  of  Pseudonocardiaceae.  Using  a  blastp 
cutoff  e- value  of  le- 5  and  a  minimum  coverage  of  70%  to  determine  reciprocal  blast  hits 
between  a  pair  of  genomes,  strain  CB1 190  is  shown  to  share  the  most  CDSs  with  the  other 
Pseudonocardia  species.  Even  though  Pseudonocar dia  sp.  PI  has  1,518  more  predicted  protein 
encoding  genes  than  Pseudonocardia  sp.  P2,  they  have  nearly  identical  numbers  of  reciprocal 
blast  hits  with  strain  CB1190  (3,251  and  3,242,  respectively).  Among  all  three  Pseudonocardia 
species,  Pseudonocardia  sp.  PI  and  Pseudonocardia  sp.  P2  are  more  closely  related  to  each 
other,  sharing  4,210  common  proteins.  Among  other  genera,  Pseudonocardia  genomes  are  most 
related  to  Amycolatopsis  genomes,  with  39-48%  similarity.  In  agreement  with  its  16S  rRNA 
phylogenetic  relationship,  T.  bispora  shares  the  least  number  of  protein- coding  genes  with 
members  of  the  lamily  Pseudonocardiaceae  (1,840  to  2,136).  As  expected,  a  high  degree  of 
similarity  is  seen  between  genomes  from  the  same  species  (95%  -99%  between  A.  mediterranei). 
Surprisingly,  the  two  genomes  from  the  same  strain  of  S.  erythraea  appear  to  be  less  similar 
(93%  -96%),  but  this  may  be  due  to  the  fact  that  the  genome  of  S.  erythraea  NRRL2338-2  is 
unfinished  while  the  genome  of  S.  erythraea  NRRL2338- 1  is  closed. 

Mobile  genetic  elements 

The  strain  CB1190  genome  contains  a  large  number  of  mobile  genetic  elements  (MGEs) 
located  on  the  chromosome  and  two  of  the  plasmids,  pPSEDOl  and  pPSED02.  A  total  of  129 
open  reading  frames  (ORFs)  encode  for  transposases  and  integrases,  belonging  to  the  insertion 
sequence  families  of  IS3,  IS4,  IS21,  IS1 1 1A  and  IS1 16.  The  majority  of  MGEs,  106  of  them,  are 
located  on  the  chromosome,  while  13  and  9  of  them  are  located  on  plasmids  pPSEDOl  and 
pPSED02,  respectively. 

Using  SIGI-HMM,  a  number  of  genomic  islands  (GIs)  were  predicted  on  the 
chromosome  of  strain  CB1 190.  Two  clusters  of  predicted  GIs  were  located  in  the  regions  of 
1.30-1.38  Mbp  (Psed_1228  to  Psed_1324)  and  3.68-3.74  bp  (Psed_3442  to  Psed_3501).  These 
regions  have  a  G+C  content  of  68.7%  and  68.3%,  respectively,  which  is  tower  than  the  G+C 
content  of  the  chromosome  (73.41%).  Both  regions  contain  a  number  of  transposase  and 
integrase  genes,  so  they  are  likely  a  result  of  a  horizontal  gene  transfer  event.  The  first  cluster 
contains  a  number  of  genes  encoded  for  heavy  metal  resistance  of  mercuric  compounds.  The 
second  cluster  contains  a  number  of  transporter  and  chaperone  genes  that  may  help  the  cell  to 
overcome  toxicity.  The  inclusion  of  a  plasmid  partitioning  gene  parB  (Psed_3493)  in  the  second 
cluster  suggests  that  this  GI  originated  from  a  plasmid. 

Signal  transduction  systems 

A  total  of  657  ORFs  were  determined  to  be  signal  transduction  proteins.  These  signal 
transduction  proteins  allow  cells  to  respond  to  environmental  changes.  According  to  the 
Microbial  Signal  Transduction  Database  (MiST2.1)  analysis,  51  ORFs  were  predicted  to  encode 
putative  histidine  kinases  and  57  were  predicted  to  encode  putative  response  regulators.  Based  on 
co-localization,  37  two-component  regulatory  systems  were  identified.  These  two-component 
systems  consist  of  a  histidine  kinase  (HK)  and  a  response  regulator  (RR).  The  HK  senses  specific 
environmental  stimuli,  while  the  RR  controls  the  cellular  response  by  the  differential  expression 
of  certain  genes. 
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Table  4.3.  Reciprocal  best  blast  hits  between Pseudonocardiaceae. 

The  values  indicate  the  number  of  shared  proteins  between  a  pair  of  genomes,  based  on  a  count 
of  all  reciprocal  best  blastp  hits  between  the  pairs,  using  a  minimum  e-value  =  le-5  and  a 
minimum  coverage  =  70%. _ 


P.  dioxanivorans 

CB1190 

A.  mediterranei  S699 

A.mediterranei  U32 

Pseudonocardia  sp.  PI 

Pseudonocardia  sp.  P2 

S.  azurea 

S.  paurometabolica 

S.  viridis 

S.  erythreaa 
NRRL2338-1 

S.  eiythraea 
NRRL2338-2 

S.spinosa 

T.  bispora 

P.  dioxanivorans 
CB1190 

- 

3219 

3209 

3251 

3242 

2392 

2368 

2210 

2842 

2845 

2789 

1890 

A.  mediterranei  S699 

3219 

- 

3035 

3138 

3109 

2943 

2846 

3666 

3665 

3358 

2136 

A.  mediterranei  U32 

3209 

9137 

- 

3033 

3135 

3106 

2939 

2846 

3657 

3657 

3346 

2135 

Pseudonocardia  sp. 
PI 

3251 

3035 

3033 

- 

4210 

2408 

2387 

2241 

2989 

2990 

2815 

1840 

Pseudonocardia  sp. 
P2 

3242 

3138 

3135 

4210 

- 

2399 

2387 

2252 

2950 

2936 

2813 

1872 

S.  azurea 

2392 

3109 

3106 

2408 

2399 

- 

3043 

3195 

2901 

2893 

2660 

1849 

S.  paurometabolica 

2368 

2943 

2939 

2387 

2387 

3043 

- 

2849 

2738 

2719 

2568 

1807 

S.  viridis 

2210 

2846 

2846 

2241 

2252 

3195 

2849 

- 

2662 

2656 

2472 

1785 

S.  erythraea 
NRRL2338-1 

2842 

3666 

3657 

2989 

2950 

2901 

2738 

2662 

- 

6901 

4091 

2024 

S.  erythraea 
NRRL2338-2 

2845 

3665 

3657 

2990 

2936 

2893 

2719 

2656 

6901 

- 

4063 

2022 

S.  spinosa 

2789 

3358 

3346 

2815 

2813 

2660 

2568 

2472 

4091 

4063 

- 

1955 

T.  bispora 

1890 

2136 

2135 

1840 

1872 

1849 

1807 

1785 

2024 

2022 

1955 

- 

As  is  common  in  bacteria,  one- component  signal  transduction  systems  are  more  abundant 
in  strain  CB1 190  than  two-component  systems.  One-component  systems  consist  of  a  single 
protein  that  contains  input  (sensory)  and  output  (regulatory)  domains  but  lack  phospho transfer 
domains  (/.<?.,  receiver  (RR)  and  transmitter  (HK)  domains)  typical  of  two-component  systems 
(158).  A  total  of  502  genes  encoding  one-component  signal  transduction  proteins  were  identified 
that  contain  at  least  one  conserved  output  domain.  The  output  domains  in  one-component 
systems  are  significantly  more  diverse  than  those  in  two  components,  containing  mostly  DNA- 
binding  domains  but  also  RNA-binding  domains,  phosphohydrolase  (HD)  domains,  phosphatase 
domains,  protein  kinase  (Pkinase)  domains,  and  nucleotide  (adenylate  and  di-guanylate)  cyclase 
domains.  Even  among  the  DNA-binding  domains,  there  is  a  higher  degree  of  variety  in  one- 
component  systems  than  two-component  systems. 

A  third  important  class  of  bacterial  signal  transduction  proteins  are  extracytoplasmic 
fiinction  (ECF)  a  factors.  All  bacteria  have  a  primary  a  factor  that  controls  basal  level  expression 
of  most  genes.  Bacterial  a  factors  assist  RNA  polymerase  in  initiating  transcription  by  allowing 
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them  to  bind  to  specific  gene  promoters  (52).  Alternative  a  tactors,  such  as  ECF  a  factors,  are 
activated  in  the  presence  of  certain  environmental  stimuli  (51).  In  the  absence  of  a  stimulus, 
alternative  a -factors  are  kept  inactive  through  protein- protein  binding  with  anti- a  factors  (18).  A 
total  of  50  ORFs  encode  for  putative  RNA  polymerase  a  factors  and  34  of  these  contain 
conserved  ECF  a  factor  domains  recognized  by  MiST2.1.  Despite  being  unrecognized  by 
MiST2.1,  the  remaining  a  factors  were  found  to  have  output  domains  homologous  to  the 
common  a70  factor  domains. 

Among  Pseudonocardiaceae,  strain  CB1190has  fewer  signal  transduction  proteins  than 
A.  mediterranei  U32  (1,141),  S.  ervthraea  NRRL2338- 1  (738),  and  S.  erythraeu  N RRL2338-2 
(727),  but  more  than  S.  viridis  P101  (323)  and  Pseudonocardia  sp.  PI  (585).  According  to  the 
MiST2.1  database,  one-component  systems  in  all  Pseudonocardiaceae  make  up  approximately 
70-80%  of  all  signal  transduction  proteins.  In  addition,  84  proteins  with  conserved  ECF  a  lactors 
in  A.  mediterranei  U32,  32  in  S.  ervthraea  NRRL2338-1,  17  in  Pseudonocardia  sp.  PI,  and  13  in 
S.  viridis  were  discovered. 

Transport  systems 

The  strain  CB1190  genome  contains  many  genes  associated  with  transport.  A  total  of  961 
genes  (13.7%  of  all  CDS)  encoding  a  broad  range  of  transport  fimctions  were  identified 
according  to  blastp  searches  (e-value  <  10e-5)  against  the  Transport  Classification  Database.  The 
majority  of  the  predicted  transport  genes  were  located  on  the  chromosome,  while  only  21,  15, 
and  1  were  identified  on  the  plasmids  pPSEDl,  pPSED2,  and  pPSED3,  respectively.  Twenty  six 
genes  were  predicted  to  encode  for  transmembrane  channels,  including  those  involved  in  the 
transport  of  potassium,  ammonia,  urea-amide,  and  magnesium-cobalt-nickel.  A  total  of  230 
genes  were  homologous  to  carriers  that  include  specificity  for  sugars,  aromatic  compounds, 
drugs,  heavy  metals,  nitrate,  nitrite,  sulfate,  amino  acids,  and  nucleosides.  Ninety  one  of  these 
potential  carriers  were  members  of  the  Major  Facilitator  Superlamily  (MFS).  The  largest  group 
of  transporters  was  the  ATP- binding  Cassette  (ABC)  Superfamily,  consisting  of  283  genes  or 
29.4%  of  all  transport  associated  genes  in  the  strain  CB1190  genome.  These  ABC  tranporters 
were  predicted  to  be  involved  in  P-P  hydrolysis  driven  transport  of  substrates  such  as 
carbohydrates,  heavy  metals,  drugs,  liposaccharides,  lipoproteins,  nitrate,  nitrite,  bicarbonate, 
organic  acids,  cholesterol,  amino  acids,  urea,  and  oligopeptides. 

A  significant  number  of  ABC  (44)  and  MFS  (45)  transporters  were  predicted  to  be 
involved  in  drug- resistance  or  export  of  drugs.  This  accounts  for  15%  of  ABC  transporters  and 
35%  of  MFS  transporters.  Sixteen  of  the  ABC  transporter  genes  were  found  to  be  members  of 
the  Drug  Exporter- 1  (DrugEl)  Family.  Three  of  the  MFS  transporter  genes  (Psed_0339, 
Psed_4022,  and  Psed_6 185)  were  identified  as  members  of  the  Drug:H+  Antiporter- 1  (DHA1) 
Family,  while  35%  of  all  MFS  transporter  genes  were  members  of  the  Dmg:H+  Antiporter-2 
(DHA2)  Family. 

The  strain  CB1190  genome  contains  a  number  of  genes  involved  in  heavy  metal 
resistance.  A  cluster  of  genes  (Psed_6718  to  Psed_6725)  is  homologous  to  the  mercury 
resistance  operon  of  Streptomyces  lividans  (134).  The  genome  of  strain  CB1190  also  contains 
two  alkylmercury  lyases  (Psed_1313  and  Psed_1314)  and  a  mercuric  reductase  (Psed_1317), 
which  are  involved  in  the  detoxification  of  mercury.  An  arsenate  resistance  operon,  containing 
genes  for  a  putative  arsenate  reductase  (Psed_2407),  an  arsenical  resistance  protein  (Psed_2408), 
and  an  arsenic-related  regulatory  protein  ArsR  (Psed_2409),  was  also  identified. 
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Secretion  systems 

Genes  for  the  general  secretory  pathway  are  present  in  the  strain  CB1190  genome.  Unlike 
E.  coli  but  similar  to  Comamonas  testosteroni  CNB-2  (88),  the  Sec  genes  identified  in  strain 
CB1 190  were  not  clustered,  except  for  secF,  secD ,  and  yajC  (Psed_3580  to  Psed_3582),  which 
encode  a  putative  protein  complex.  The  strain  CB1190  genome  also  contains  genes  predicted  to 
be  members  of  the  Type  III  (IIISP)  and  Type  IV  (IVSP)  secretory  pathways,  which  are  typically 
involved  in  the  transport  of  virulence  factors.  Three  type  IV  secretory  pathway  genes  are  located 
on  plasmid  pPSED2  (Psed_6900,  Psed_6901,  and  Psed_6994). 

Chaperone  proteins 

The  genome  of  strain  CB1190  contains  51  chaperone  genes.  Two  genes  (Psed_6730  and 
Psed_6374)  were  annotated  as  hypC/hupF,  chaperones  involved  in  the  assembly  of 
hydrogenases.  A  gene  annotated  as  a  nitrite  reductase  molybdenum  colactor  assembly  chaperone 
(Psed_4302)  was  also  identified.  A  number  of  genes  were  also  identified  to  encoded  for  heat 
shock  and  cold  shock  proteins.  Similar  to  Rhodococcus  jostii  RHA1  (137)  and  Gordonia  sp.  TY- 
5  (72),  a  chaperonin  GroEL  (Psed_0633)  is  found  within  a  cluster  of  genes  that  encode  for  a 
putative  propane  monooxygenase. 

Nitrogen  metabolism 

Strain  CB1190  can  use  multiple  nitrogen  sources  (90),  and  this  is  reflected  in  the  genome 
annotation.  Ammonia  in  AMS  medium  can  be  incorporated  by  homologues  of  glutamine 
synthetase  (Psed_2546,  Psed_3968,  Psed_4574,  and  Psed_4917),  which  condenses  ammonia 
with  glutamate  to  fonn  glutamine.  Strain  CB1 190  can  use  nitrate,  and  a  total  of  eight  ORFs 
encode  nitrate  reductases,  which  convert  nitrate  to  nitrite.  One  of  these  nitrate  reductase  genes 
(Psed_1088)  is  in  a  cluster  of  genes  that  encode  for  subunits  of  a  nitrite  reductase  (Psed_1089  to 
Psed_  1091),  a  nitrite  transporter  (Psed_  1087),  and  an  uroporphyrinogen- III  synthase  HEM4 
(Psed_1092).  Based  onblastp  analyses,  the  reductases  in  this  gene  cluster  are  most  similar  to 
assimilatory  NAD(P)H- dependent  nitrate  and  nitrite  reductases  in  Amycolatopsis  mediterranei 
U32,  which  ftmetion  to  reduce  nitrate  to  NH3  via  NO2"  and  hydroxylamine.  In  strain  CB1 190,  the 
uroporphyrinogen- III  synthase  catalyzes  the  synthesis  of  uroporphyrinogen- III,  a  precursor  for 
siroheme,  which  is  a  cofactor  for  nitrite  reductase  (135).  In  addition  to  being  able  to  assimilate 
NO 3',  strain  CB1 190  also  appears  capable  of  dissimilatory  nitrate  reduction.  The  presence  of  a 
cluster  of  four  ORFs  encoding  for  subunits  of  a  respiratory  nitrate  reductase  (Psed_4300  to 
Psed_4303)  suggests  the  ability  to  use  NO3'  as  a  tenninal  electron  acceptor  under  anaerobic 
conditions.  However,  genes  for  respiration  of  NO2’  are  absent,  such  as  nrfA- encoded  cytochrome 
c  nitrite  reductase,  which  reduces  NO2’ to  ammonia,  or  the  n irS- encoded  c y to c h ro  1  ne  cdl  nitrite 
reductase  or  the  nirK-  encoded  copper  nitrite  reductase,  which  can  each  reduce  NO  2"  to  nitric 
oxide  (NO).  Rather,  the  NO2"  is  reduced  to  NFLi+  by  either  assimilatory  or  disimilatory 
mechanisms  without  producing  a  proton-motive  force  (i.e.,  during  NO2"  respiration). 

Furthermore,  nitric  oxide  and  nitrous  oxide  reductases  are  missing  in  the  genome,  meaning  NO  3' 
cannot  be  reduced  all  the  way  to  dinitrogen  (N2).  However,  genes  are  present  for  nitric  oxide 
dioxygenases  (Psed_4325,  Psed_5584,  and  Psed_6551),  which  may  allow  strain  CB1190  to  cope 
with  nitric  oxide  stress  by  oxidizing  it  to  nitrate. 

It  was  previously  shown  that  strain  CB1 190  can  fix  dinitrogen  (N2)  (90),  an  activity  that 
has  not  been  reported  for  other  Pseudonocardia.  Evidence  for  this  activity  included  the 
conversion  of  acetylene  to  ethylene  (a  common  activity  of  nitrogenases)  and  the  production  of 
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ammonia  by  cells  growing  in  mineral  media  void  of  nitrogen  sources  (i.e.,  NH3,N03_,  orNCF’). 
However,  no  homologues  to  nitrogenase  genes  were  found  in  the  genome  of  strain  CB1190, 
except  for  two  genes  annotated  as  nifU -like  proteins  (Psed_3394  and  Psed_6367)  and  one  as  a 
M/C- like  porter  (Psed_5656).  Many  organisms  which  are  unable  to  fix  dinitrogen  contain  nifXJ- 
like  proteins  because  they  can  assist  in  the  fonnation  of  metalloc  lusters,  typically  FeS  clusters, 
for  proteins  other  than  nitrogenases.  Similarly,  nz/C-like  porters,  which  are  involved  in  the 
transport  of  molybdenum  for  the  molybdenum-containing  nitrogenase  enzymes,  could  transport 
molybdenum  for  other  enzymes  or  co- factors.  While  a  unique  system  called  superoxide- 
dependent  N2  (sdn)  fixation  was  described  in  Streptomyces  thermoautotrophicus,  in  which  N2 
reduction  to  ammonium  is  coupled  to  oxidation  of  superoxide  produced  from  O2  and  CODH,  the 
short  N-teminal  polypeptide  sequences  for  the  CODH,  the  superoxide  oxidoreductase,  and  the 
novel  oxygen-insensitive  nitrogenase  from  S.  thermoautotrophicus  are  not  sufficient  for 
homology  searches  (126).  Although  Hoffinann-F indeklee  et  al.  (2002)  describe  cloning  and 
sequencing  of  sdn  genes  from  S.  thermoautorophicus  and  suggest  that  the  nitrogenase  belongs  to 
the  molybdenum- hydroxylase  family,  neither  nucleotide  nor  amino  acid  sequences  are  publicly 
available.  Therefore,  the  N2  fixation  system  used  by  strain  CB1 190  remains  unknown. 

General  carbon  metabolism 

The  ability  of  strain  CB1190  to  grow  on  mineral  medium  indicates  it  can  synthesize  all  of 
its  cellular  components,  including  fatty  acids,  nucleotides  and  amino  acids,  and  in  silico  analysis 
of  the  genome  supports  this  notion.  Complete  pathways  are  present  for  the  synthesis  of  22  amino 
acids,  including  the  selenoamino  acids:  selenocysteine  and  selenomethionine.  Both  the  glycolysis 
and  pentose  phosphate  (PP)  pathway  are  complete,  suggesting  that  glucose  can  be  metabolized 
through  at  least  two  pathways.  In  addition,  the  entire  citric  acid  or  tricarboxylic  acid  (TCA)  cycle 
is  present. 

The  genome  sequence  also  suggests  that  strain  CB1190  can  metabolize  xylose  using  the 
isomerase  pathway.  The  genome  contains  1 1  putative  xylose  isomerase  genes  (Psed_0624, 

Psed  l 104,  Psed_2264,  Psed_2595,  Psed_3711,  Psed_3726,  Psed_5635,  Psed_6921,  Psed_6931, 
Psed_6943,  and  Psed_6944),  which  encode  proteins  to  convert  D-xylose  into  D- xylulose.  Also 
present  are  genes  for  two  xylulokinases  (Psed_1102  and  Psed_3672)  that  can  phosphorylate  D- 
xylulose  into  D-xylulose-5-phosphate,  an  intennediate  in  the  PP  pathway. 

Carbon-fixation  pathways 

Strain  CB1 190  grows  autotrophically  by  fixing  CO2  in  the  presence  of  H2  (1 12).  Six 
primary  CO2  fixation  pathways  have  been  identified  in  prokaryotes:  1)  the  Calvin- Benson- 
Bassham  (CBB)  pathway,  2)  the  Reductive  TCA  cycle  or  reverse  citric  acid  cycle,  3)  the 
reductive  acetyl  Co-A  pathway  or  Wood-Ljundahl  pathway,  4)  the  3-hydroxypropionate 
pathway/malyl-CoA  pathway  (3-HP),  5)  the  3-hydroxypropionate/4-hydroxybutyrate  cycle,  and 
6)  the  dicarboxylate/4-hydroxybutyrate  cycle  (77).  A  search  of  the  strain  CB1 190  genome  for 
genes  related  to  each  of  these  pathways  was  conducted,  and  revealed  the  presence  of  a  near- 
complete  CBB  pathway.  Genes  potentially  contributing  to  the  other  five  pathways  were  also 
identified,  but  the  CB1190  genome  lacks  homologues  enzymes  catalyzing  several  steps  in  each 
of  these  five  pathways,  so  the  potential  activity  of  these  pathways  could  not  be  inferred  from 
genome  sequence  information. 
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Monooxygenases 

The  first  step  in  dioxane  biodegradation  by  strain  CB1190  is  catalyzed  by  a 
monooxygenase  reaction  ((89,  91)  and  Chapter  6).  Monooxygenases  are  enzymes  that  use  the 
energy  from  NAD(P)H  to  reduce  dioxygen  (Oi)  into  water  and  a  hydroxyl  group,  which  is 
subsequently  incorporated  onto  a  substrate.  A  total  of  84  ORFs  are  predicted  to  encode  for 
monooxygenase  components.  Since  monooxygenases  catalyze  a  hydroxylation  reaction,  they  are 
often  referred  to  and  contain  proteins  encoded  as  hydroxylases.  The  genome  of  strain  CB1190 
contains  14  ORFs  annotated  as  hydroxylases.  In  addition,  1 1  ORFs  are  predicted  to  encode  for 
cytochrome  P450  (CYP)  enzymes,  which  are  known  to  catalyze  monooxygenase  reactions  of  a 
variety  of  organic  compounds. 

Eight  gene  clusters  of  bacterial  multicomponent  monooxygenases  (BMMs)  are  present  in 
the  strain  CB1190  genome.  These  BMMs  contain  two  to  six  co-located  genes,  including  at  least 
a  hydroxylase  protein  and  a  reductase  protein.  The  reductase  protein  supplies  electrons  to  the 
active  site  of  the  hydroxylase  component  by  oxidizing  NAD(P)H.  These  electrons  allow  the 
hydroxylase  component  to  catalyze  the  hydroxylation  of  the  substrate.  The  hydroxylase 
component  of  BMMs  may  consist  of  two  proteins:  a  large  a  subunit  and  smaller  (1  subunit. 
Classification  of  BMMs  is  based  upon  their  substrate  specificity.  The  subunit  composition  and 
arrangement  of  coding  sequences  differs  for  each  type  of  BMM.  Therefore,  the  order  of  co¬ 
localized  genes  and  their  similarity  to  sequences  of  known  BMM  components  were  used  to 
categorize  and  deduce  the  function  of  the  eight  BMM  clusters  in  strain  CB1190  (see  Figure  4.1). 

Seven  of  the  BMM  clusters  are  located  on  the  chromosome,  including  a  propane 
monooxygenase  ( prm )  gene  cluster  (Psed_0629  to  Psed_0632)  and  a  putative  phenol 
hydroxylase  (Psed_0770  to  Psed_0766).  Four  chromosome-encoded  BMMs  are  similar  to 
aromatic  monooxygenases  that  target  benzene,  toluene,  ethylbenzene,  and  xylene.  The  clusters 
denoted  as  aromatic  monooxygenase  1  (Psed_08 15  to  Psed_08 10)  and  aromatic  monooxygenase 
3  (Psed_1436  to  Psed_1441)  in  Figure  4.1  have  identical  tmoABDEC  gene  structures. 
Furthermore,  the  corresponding  proteins  in  aromatic  monooxygenase  1  and  aromatic 
monooxygenase  3  share  46.6%  to  8 1 .6%  identity.  Similarly,  the  order  of  the  genes  in  aromatic 
monooxygenase  2  (Psed_l  155  to  Psed  l  159)  and  aromatic  monooxygenase  4  (Psed_6062  to 
Psed_6058)  are  identical  to  one  another  and  are  similar  to  that  of  the  toluene/o-xylene 
monooxygenase  louABCDEF  gene  cluster  found  in  Pseudomonas  stutzeri  0X1  (14)  and  the 
toluene  4-monooxygenase  tmoA  BCDEF  gene  cluster  found  in  Pseudomonas  mendocina  KR1 
(174,  175).  The  protein  sequences  in  aromatic  monooxygenase  2  and  4  are  37.8%  to  50.0% 
identical. 

The  simplest  BMM  found  is  a  two-component  4-hydroxyphenylacetate  3- 
monooxygenase  encoded  by  Psed_6066-67.  This  monooxygenase  is  a  flavin  adenine 
dinucleotide  (FAD)- dependent  monooxygenase.  The  large  component  HpaB  (Psed_6066) 
utilizes  FADH2  to  oxidize  4-hydroxyphenylacetate  (HP A)  (122).  The  FADH2  required  for 
hydroxylation  is  produced  by  the  small  subunit  hpaC  (Psed_6066),  a  flavin :N  A D(P)H  reductase, 
which  oxidizes  NAD(P)H  in  order  to  reduce  FAD  (40). 
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Figure  4.1.  Bacterial  multicomponent  monooxygenases  (BMMs)  in  strain  CB 1190.  Seven  of  the 
BMMs  (A-G)  are  located  on  the  chromosome,  while  the  tetrahydrofuran  monooxygenase 
(THFMO)  is  located  on  the  plasmid  pPSED02.  prmA,  propane  monooxygenase  (PrMO) 
hydroxylase  large  subunit;  prmB,  PrMO  reductase;  prmC,  PrMO  hydroxylase  small  subunit; 
pnnD,  PrMO  coupling  protein;  poxB/dpmL,  phenol  hydroxylase  PI  protein;  pox C/dpm M,  plicno I 
hydroxylase  P2  protein;  poxD/dpmN,  phenol  hydroxylase  P3  protein;  poxE/dpmO,  phenol 
hydroxylase  P4  protein;  poxF/dmpP,  phenol  hydroxylase  P 5  protein;  tmoA/touA,  toluene 
monooxygenase  (TMO)  hydroxylase  a  subunit;  tmoB/touB ,  TMO  hydroxylase  P  subunit; 
tmoC/touC,  TMO  eflector  protein;  tmoD/touD, TMO  protein  D;  tmoE/touE,  TMO  protein  E; 
EtpaB,  4-hydroxyphenylacetate-3-hydroxylase  reductase  component,  HpaC,  4- 
hydroxyphenylacetate-3-hydroxyalse  oxygenase  component;  thmA,  THFMO  oxygenase 
component  a  subunit;  thmB,  THFMO  oxygenase  component  P  subunit;  thmC,  TFIFMO  coupling 
protein;  thmD,  TFIFMO  reductase  component.  Note  Psed_6976*  is  annotated  as  a  pseudogene 
(GenBank  CP002597). 
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The  only  non- chromosomal  BMM  is  a  tetrahydrofuran  (THF)  monooxygenase 
(Psed_6976  to  Psed_6979),  which  is  located  on  plasmid  pPSED02  and  is  in  a  cluster  of  nine 
genes  (Psed_6974  to  Psed_6982).  Eight  of  the  genes  are  highly  homologous  to  tetrahydrofuran 
monooxygenase  (thin)  gene  clusters  in  Pseudonocardia  sp.  ENV478,  P.  tetrahydrofuranoxydans 
Kl,  and  Rhodococcus  sp.  YYL  (Genbank  HQ99619.1,  AJ296087.1,  and  EU732588.2, 
respectively).  The  eight  common  thm  genes  are  orfY,  sad,  thmA  ,  thmD,  thmB,  thmC,  orfZ,  and 
aldP[.  The  one  gene  in  this  cluster  found  only  in  strain  CB1190  encodes  for  aNRAMP  tamily 
Mn2+/Fe2+ transporter  (Psed_6982).  The  existence  of  this  Fe2+ transporter  makes  sense  because 
of  the  high  similarity  of  THF  monooxygenases  to  binuclear-iron-containing  multicomponent 
monooxygenases  (153,  154).  Sequence  analysis  indicates  that  the  thm  cluster  in  strain  CB1190, 
excluding  the  transporter  gene,  shares  96%,  94%,  and  87%  identity  at  the  nucleotide  level  to 
Pseudonocardia  sp.  ENV478,  P.  tetrahydrofuranoxydans,  and  Rhodoccocus  sp.  YYL, 
respectively.  The  discovery  of  the  thm  cluster  on  plasmid  pPSED02  in  strain  CB1190  is 
consistent  with  previous  findings  that  the  thm  gene  sequences  in  P.  tetrahydrofuranoxydans  had 
a  characteristically  low  G+C  content  and  was  shown  to  be  located  on  a  large  plasmid  (pPSK50) 
(154).  The  thm  proteins  encoded  on  these  plasmids  are  nearly  identical,  sharing  91.4  to  99.6% 
similarity  (Fig.  4.2).  Furthermore,  the  GC  content  of  the  thm  cluster  in  strain  CB1190  is  61.3%, 
while  the  GC  content  of  the  entire  plasmid  pPSED2  is  68.4%. 
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Figure  4.2.  THF  MO  gene  clusters  from  strain  CB1190  and  P.  tetrahydrofiiranoxydans.  orfY, 
unknown  function;  sad,  succinate  semialdehyde  dehydrogenase;  thmA,  TFIF  MO  a  subunit;  thmB, 
THFMO  P- subunit;  thmC,  THFMO  reductase;  thmD,  TFIFMO  coupling  protein;  orfZ,  unknown 
fiinction;  aldH,  aldehyde  dehydrogenase.  Yellow  indicates  monooxygenase  large  subunit,  red 
indicates  reductase,  green  indicates  monooxygenase  small  subunit,  and  blue  indicates 
monooxygenase  coupling  protein. 


Dioxygenases 

The  strain  CB1190  genome  contains  92  genes  annotated  as  putative  dioxygenase 
components  that  catalyze  the  incorporation  of  both  atoms  of  dioxygen  into  substrates  as  hydroxyl 
groups.  Many  of  these  dioxygenases  appear  to  be  involved  in  the  degradation  of  aromatic 
compounds.  Two  of  the  dioxygenase  genes  are  located  on  plasmid  pPSED02:  an  extradiol  ring- 
cleave  dioxygenase  class  III  protein  subunit  B  (Psed_6935)  and  aphthalate  3,4-dioxygenase 
ferredoxin  subunit  (Psed_6938).  Half  of  the  putative  dioxygenases  (46)  are  annotated  as 
glyoxylase/bleomycin  resistance  protein/dioxygenases  and  are  distributed  throughout  the 
chromosome.  Some  of  the  dioxygenases  appear  to  be  involved  in  the  metabolism  of  aromatic 
amino  acids,  such  as  phenylalanine  with  3-carboxyethylcatechol  2,3 -dioxygenase  (Psed_0244) 
and  tyrosine  with  gentisate  1,2 -dioxygenase  protein  (Psed  1337)  and  4-hydroxyphenylpyruvate 
dioxygenase  (Psed_1410).  Genes  annotated  to  be  involved  in  the  degradation  of  the  following 
compounds  were  identified  in  the  strain  CB1 190  genome:  protocatechuate  (Psed_1566to 
Psed_1564),  catechol  (Psed_3093  to  Psed_3095);  vanillate  (Psed_2465  to  Psed_2466).  A  number 
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of  genes  suggesting  that  assimilation  of  benzoate  may  occur  via  benzoyl-CoA  were  discovered, 
such  as  those  annotated  as  benzoate-CoA  ligase  (Psed_1491),  benzoyl-CoA-dihydrodiol  lyase 
(Psed_1490),  and  benzoyl-CoA  oxygenase  component  B  (Psed_1489). 

A  few  ORFs  annotated  as  dioxygenases  are  co-localized  with  BMM  cluster  genes 
involved  in  hydroxylation  of  aromatic  compounds.  A  catechol  1,2-dioxygenase  gene 
(Psed_0764)  is  found  downstream  of  the  phenol  monooxygenase  gene  cluster  (Psed_0770  to 
Psed_0766).  Another  ORF  annotated  as  a  catechol  1,2- dioxygenase  (Psed_6065)  is  located 
immediately  downstream  of  the  HPA  monooxygenase  gene  cluster  (Psed_6067  to  Psed_6066) 
and  a  few  genes  upstream  of  the  aromatic  monooxygenase  4  gene  cluster  (Psed_6062  to 
Psed_6058).  The  co-location  of  these  BMMs  and  the  catechol  1,2- dioxygenase  genes  is 
consistent  with  the  fact  that  the  metabolism  of  aromatic  compounds  (such  as  toluene,  phenol,  and 
HPA)  typically  involves  monooxygenase  reactions  that  produce  dihydroxy  aromatic  compounds 
(such  as  catechol,  benzoate,  and  dihydroxyphenylacetate).  These  dihydroxy  aromatic  compounds 
are  substrates  of  ring- cleaving  dioxygenase  reactions,  which  produce  intermediates  that  can  enter 
the  TCA  cycle  (27,  31). 

Discussion 

P.  dioxan  iv  ora  ns  stra  in  CB1190  is  the  first  bacterium  with  a  finished  genome  in  the 
genus  Pseudonocardia.  In  general,  members  of  the  family  Pseudonocardiaceae  are  characterized 
with  a  high  GC-content  (67-73%)  and  a  large  genome  size  (4.3-10.2  Mb).  Among  the 
Pseudonocardiaceae  genomes,  strain  CB1190  (along  with  Pseudonocardia  sp.  PI)  has  the 
highest  GC  content  (73%)  but  is  medial  in  terms  of  its  total  genome  size  (7.1  Mb)  and  CDS 
count  (6,797).  The  Pseudonocardia  genomes,  in  general,  share  a  large  number  of  protein 
encoding  genes  with  the  genomes  of  A.  mediterranei  (3,035-3,219  CDS). 

The  majority  of  the  sequenced  Pseudonocardiaceae  genomes  are  from  organisms  with 
industrial  and  medical  importance,  with  the  primary  purpose  to  determine  the  proteins  involved 
in  the  secondary  metabolism  of  antibiotic  production  in  these  organisms.  In  strain  CB1190,  the 
only  secondary  metabolism-associated  genes  that  we  identified  were  genes  encoding  for 
antibiotic  biosynthesis  monooxygenases,  beta-lactamases,  polyketide  cyclase/dehydrases,  and 
proteins  necessary  for  the  production  of  precursors  (e.g.,  malonyl-CoA  and  methylmalonyl-CoA) 
used  in  secondary  metabolism 

The  presence  of  a  large  number  of  genes  for  antibiotic  resistance  in  the  CB1 190  genome 
suggests  that  strain  CB1 190  evolved  in  a  complex  and  competitive  microbial  environment  with 
antibiotic  producing  organisms.  Genes  encoding  a  wide  variety  of  drug  efflux  proteins,  including 
multidrug  transport  proteins  and  those  putatively  targeting  antibiotics  such  as  acriflavin, 
actinorhodin,  camphor,  cephamycin,  chloramphenicol,  landomycin,  methicillin,  and  rifamycin, 
were  identified.  In  addition  to  transporting  mechanisms  for  coping  with  antimicrobial 
metabolites,  half  of  the  putative  dioxygenase  genes  spread  across  the  genome  are  related  to  the 
bleomycin  resistance  protein,  which  could  degrade  antibiotics  into  less  toxic  products.  The 
genome  also  harbors  genes  for  transport  and  signal  transduction  mechanisms  to  cope  with  heavy 
metal  toxicity.  Therefore,  in  addition  to  dealing  with  a  competitive  microbial  community,  strain 
CB1 190  appears  to  be  well- suited  to  deal  with  a  hostile  environment,  as  would  be  expected  of  its 
isolation  source:  dioxane  contaminated  sludge.  However,  the  nature  and  extent  of  heavy  metal 
toxicity  that  strain  CB1190  can  withstand  has  not  been  studied.  Characterizing  the  effect  of 
heavy  metals  on  contaminant  degradation  by  strain  CB1190  could  determine  the  feasibility  of 
bioremediation  at  dioxane  sites  contaminated  with  heavy  metals. 
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Previous  studies  have  implicated  monooxygenases  in  catalyzing  the  first  step  of  dioxane 
degradation  in  strain  CB1190  (89,  91)  (see  Chapters  5  and  6  for  experimental  evidence). 
Monooxygenase- specific  acetylene  inhibition  of  THF  and  dioxane  degradation  by  THF- grown 
CB1190  and  P.  tetrahydrofuranoxydans  had  demonstrated  that  a  putative  but  biochemically 
uncharacterized  THF  monooxygenase  is  involved  in  the  degradation  of  dioxane  by  P. 
tetrahydrofuranoxydans  and  simultaneously  suggested  that  strain  CB1190  possesses  a 
homologous  monooxygenase  enzyme  (91).  The  THF  monooxygenase  genes  thmADBC  located 
on  a  plasmid  in  P.  tetrahydrofuranoxydans  encode  for  a  four  subunit  enzyme  homologous  to 
binuclear- iron-containing  BMMs  (154).  Although  a  nearly  identical  thm  gene  cluster  was  found 
on  plasmid  pPSED02,  strain  CB1190  contains  seven  additional  BMMs  that  could  potentially  be 
involved  in  dioxane  degradation.  The  aromatic  monooxygenases  1-4  (Fig.  4.1)  share  a  high 
degree  of  similarity  with  toluene  monooxygenases.  In  other  organisms  toluene-2- 
monooxygenase  (T2MO),  toluene-/;- mo noo xyge nasc  (T/;MO),  and  toluene-4-monoxygenase 
(T4MO)  were  found  to  degrade  dioxane,  but  not  toluene-o-xylene- monooxygenase  (ToMO)  (89). 
Interestingly,  aromatic  monooxygenases  2  and  4  are  similar  in  sequence  similarity  and  gene 
order  ( tmoABCDE )  to  both  T4MO  in  Pseudomonas  mendocina  KR1,  which  can  degrade 
dioxane,  and  ToMO  in  Pseudomonas  strutzeri  0X1,  which  cannot  degrade  dioxane.  The  gene 
clusters  of  aromatic  monooxygenases  1  and  3  also  contain  T4M0  and  ToMO  homologous  genes 
but  the  arrangement  of  genes  in  these  clusters  is  unique,  tmoABDEC .  In  fact,  the  gene 
neighborhoods  downstream  of  aromatic  monooxygenases  1  and  3  are  highly  identical,  suggesting 
the  likelihood  of  a  large  duplication  event  in  the  genome  of  at  least  5.9  kb.  Toluene-grown  P. 
tetrahydrofuranoxydans  also  degrades  dioxane,  but  the  type  or  sequence  of  the  toluene-induced 
monooxygenase  has  not  been  detennined  (89).  Although  a  propane- induced  monooxygenase 
degrades  dioxane  in  Mycobacterium  vaccae  J0B5  (19),  suggesting  that  the  propane 
monooxygenase  identified  in  strain  CB1190  could  oxidize  dioxane,  we  did  not  detect  a  propane 
monooxygenase  in  M.  vaccae  JO B5  using  degenerate  oligonucleotide  PCR  primers  (Chapter  2). 
Therefore,  a  role  for  the  strain  CB1 190  propane  monooxygenase  homologue  in  dioxane 
degradation  in  unclear.  Sequence  analysis  and  the  arrangement  of  their  genes  indicate  that  the 
prm  and  thm  clusters  in  strain  CB1190  are  closely  related. 

Conclusion 

The  genome  of  Pseudonocardia  dioxanivorans  strain  CB1 190  represents  the  first 
complete  and  annotated  sequence  of  both  a  dioxane-metabolizing  microorganism  and  a 
representative  of  the  genus  Pseudonocardia.  This  annotation  reveals  the  potential  for  strain 
CB1 190  to  use  a  wide  variety  of  organic  substrates,  as  is  typical  for  environmental 
actinobacteria.  The  genome  provides  a  foundation  for  future  studies  to  explore  the  genetic  and 
biochemical  basis  of  strain  CB1190  physiology,  such  as  dioxane  and  C2  carbon  metabolism,  as 
explored  in  Chapter  5  and  7,  respectively. 


32 


5.  Transcriptional  analysis  of  Pseudonocardia  dioxanivorans  strain  CB1190 

dioxane  degradation 


This  is  a  summary  of  work  published  in: 

1.  Grostem,  A.,  C.M.  Sales,  W.-Q.  Zhuang,  O.  Erbilgin,  and  L.  Alvarez-Cohen.  2012. 
Glyoxylate  metabolism  is  a  key  feature  of  the  metabolic  degradation  of  1,4-dioxane  by 
Pseudonocardia  dioxanivorans  strain  CB1 190.  Applied  &  Environmental  Microbiology,  in  press. 

2.  Sales,  C.  M.  2012.  Functional  genomics  of  bacterial  degradation  of  the  emerging 
contaminants  1,4-dioxane  and  N-nitrosodimethylamine  (NDMA).  Ph.D.  thesis.  University  of 
California,  Berkeley. 

3.  Additional  results  presented  here  are  in  publications  in  preparation. 

Introduction 

The  cyclic  ether  dioxane  is  part  of  the  growing  class  of  emerging  contaminants,  whose  wide 
distribution  in  the  environment  has  only  recently  been  recognized.  Dioxane  is  highly  soluble  in 
water  and  thus  is  easily  transported  in  groundwater  systems  from  point  source  contamination 
(181).  Although  the  long-tenn  effects  of  dioxane  exposure  on  human  health  are  not  well 
understood,  animal  studies  indicate  the  compound  is  a  potential  carcinogen  (146).  Consequently, 
dioxane  contamination  has  become  the  focus  of  remediation  research,  and  microbial  systems 
capable  of  dioxane  degradation  have  been  investigated  (181).  The  degradation  of  dioxane  under 
co-metabolic  conditions  (where  other  substrates  provide  carbon  and  energy  for  growth)  appears 
to  be  somewhat  common  (19,  89,  101,  141,  148,  161),  but  an  increasing  number  of 
microorganisms  have  also  been  identified  that  can  use  dioxane  as  a  sole  source  of  carbon  and 
energy  (13,66,  89,  101,  112). 

Initial  investigations  of  biological  dioxane  transfonnation  were  perfonned  as  part  of 
toxicological  studies,  which  identified  hydroxyethoxyacetic  acid  (HEAA)  and  dioxane-2-one  as 
the  major  metabolites  in  humans  (178)  and  rats  (168),  respectively.  Recently,  the  pathways  of 
dioxane  biotransformation  have  been  studied  in  a  number  of  metabolizing  and  co-metabolizing 
bacteria  and  ftmgi  in  an  attempt  to  understand  the  fate  of  environmentally- re  leased  dioxane. 
Similar  to  the  findings  of  the  toxicological  studies,  a  common  feature  of  dioxane  transformation 
is  the  initial  hydroxylation  of  the  dioxane  ring  by  monooxygenase  enzyme  systems  (89,  141, 

161).  The  hydroxylated  metabolite  then  undergoes  uncharacterized  biotic  or  abiotic  processing, 
leading  to  an  opening  of  the  ring  structure.  Researchers  have  used  a  number  of  techniques  to 
detect  post-hydroxylation  metabolites  in  order  to  elucidate  the  dioxane  catabolic  pathway.  In  the 
dioxane-co-metabolizing  fungus  Cordyceps  sinesis,  deutcratcd  dioxane-t/x  and  GC -MS  analysis 
were  used  to  identify  ethylene  glycol,  glycolic  acid  and  oxalic  acid  as  metabolites  (101).  In 
Mycobacterium  sp.  PH-06,  which  grows  using  dioxane,  deuterated  dioxane  and  GC-MS  analysis 
detected  only  two  metabolites,  l,4-dioxane-2-ol  and  ethylene  glycol  (66).  Meanwhile,  2- 
hydroxyethoxyacetic  acid  (2HEAA)  was  the  sole  metabolite  detected  with  14C-labeled  dioxane  in 
the  co-metabolizing  bacterium  Pseudonocardia  strain  ENV478  (161). 

A  more  comprehensive  analysis  of  dioxane  metabolites  was  recently  perfonned  with  the 
dioxane-metabolizing  Pseudonocardia  dioxanivorans  strain  CB1 190  (designated  strain  CB1 190) 
(91).  By  analyzing  metabolites  with  Fourier  transfonn  ion  cyclotron  resonance-mass 
spectrometery,  MS-MS,  and  by  tracking  distribution  of  14C-labeled  dioxane  metabolites,  the 
previously  identified  metabolites  were  confinned,  and  2-hydroxyethoxyacetaldehyde,  1,2- 
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dihydroxyethoxyacetic  acid,  2-hydroxyethoxy-2-hydroxyacetic  acid,  glycoaldehyde,  glyoxylic 
acid  and  CO2  were  also  identified  as  dioxane  metabolites  (91).  Based  on  these  findings,  a 
dioxane  biodegradation  pathway  was  proposed  (Fig.  5.1,  black  arrows),  whereby  all  metabolites 
feed  through  oxalic  acid  into  central  metabolism  (91). 

These  studies  have  improved  our  understanding  of  the  late  of  dioxane  during  microbial 
biodegradation,  but  there  is  a  knowledge  gap  in  tenns  of  the  biochemical  and  genetic  basis  for 
this  pathway.  While  monooxygenase  enzymes  have  been  implicated  in  the  initial  activation  of 
the  dioxane  ring,  the  fiirther  steps  in  dioxane  metabolism  have  not  been  explored.  In  particular, 
the  mechanism  for  incorporation  of  dioxane  metabolites  into  biomass  in  microorganisms  that  can 
grow  with  dioxane  as  the  sole  carbon  and  energy  source  is  unknown.  In  this  work  we  used  the 
recently- determined  genome  off.  dioxanivorans  strain  CB1 190  (132)  to  investigate  how 
dioxane  is  assimilated  into  biomass  to  support  growth.  We  present  evidence  highlighting  the 
unique  role  of  glyoxylate  in  bacterial  dioxane  metabolism 

Materials  and  Methods 

Culture  growth 

Strain  CB1190  was  cultivated  in  ammonium  mineral  salts  (AMS)  medium  (112). 

Replicate  were  amended  with  the  appropriate  carbon  source.  These  were  then  inoculated  with 
1/500  dilution  dioxane- grown  strain  CB1190  culture  and  were  incubated  at  30°C  with  shaking. 
Sterile  (un- inoculated)  controls  were  prepared  in  parallel.  Bottles  were  amended  with  dioxane, 
glycolate,  or  pyruvate.  Glycolate  was  chosen  since,  of  the  known  dioxane  degradation 
intermediates,  it  best  supported  strain  CB1190  growth. 

Cell  harvesting  and  RNA  extraction  for  transcription  studies 

Cells  were  harvested  when  approximately  half  of  the  added  substrate  had  been  consumed. 
Cells  from  replicate  bottles  were  collected  by  filtration  onto  triplicate  filters,  were  scraped  from 
the  filters  with  a  sterile  scalpel,  and  were  transferred  to  2  mL  screw-top  microcentrifuge  tubes 
containing  zirconia-silica  beads  (Biospec  Products,  Bartlesville,  OK).  Harvested  cells  were 
stored  at  -80°C  until  RNA  extraction. 

Nucleic  acids  were  extracted  using  a  previously  published  phenol  method  (60).  To  obtain 
RNA,  resuspended  nucleic  acids  were  initially  separated  with  the  Allprep  kit  (Qiagen),  and  then 
the  RNA  was  purified  using  the  RNeasy  kit  (Qiagen).  Following  elution  with  RNase-free  water, 
any  contaminating  DNAwere  removed  by  two  successive  DNase  I  treatments  using  the  DNA- 
free  kit  (Ambion,  Austin,  TX)  according  to  the  manufacturer's  instructions.  Pure  RNA  was 
obtained  with  a  final  cleanup  with  the  RNeasy  kit. 
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Figure  5.1.  Proposed  pathways  and  enzymes  involved  in  dioxane  metabolism  in  strain  CB1190. 

Black  arrows  indicate  the  transformations  previously  proposed  by  Mahendra  etal.  (91).  Grey  arrows 
indicate  transformations  supported  by  in  silico  or  experimental  results  in  the  current  work.  Dashed  arrows 
indicate  multi-step  transformations. 

Microarray  design  and  analyses 

A  draft  version  of  the  strain  CB1190  genome  was  obtained  from  the  Joint  Genome 
Institute  and  consisted  of  6896  protein- encoding  sequences  in  226  contigs.  These  were  submitted 
to  Aflymetrix  (Santa  Clara,  CA)  lor  custom  chip  design.  The  final  design  consisted  of  a  set  of 
perfect  match-only  25-tner  probes,  with  each  sequence  targeted  by  11-13  probes,  and  also 
included  a  standard  set  of  Aflymetrix  controls  for  prokaryotic  gene  expression  microarrays. 
Subsequent  to  microarray  production,  a  finished  genome  sequence  for  strain  CB1 190  was  made 
available  (132).  The  microarray  probes  were  re-annotated  to  reflect  the  finished  genome 
sequence.  Re- annotated  microarray  probe  sets  targeted  6391  (approximately  94%)  of  the  coding 
sequences  in  the  finished  genome. 
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Five  jug  of  total  RNA  was  used  as  starting  material  for  each  microarray  analysis. 

Triplicate  microarrays  were  processed  for  each  substrate  treatment.  cDNA  was  synthesized, 
fragmented,  labeled,  and  hybridized  to  arrays  according  to  the  protocols  outlined  in  section  3  of 
the  Aflymetrix  GeneChip  Expression  Analysis  technical  manual,  with  the  following  changes: 
cDNA  fragmentation  was  achieved  by  the  addition  of  0.07  U  DNase  I/jjg  cDNA,  and  the 
hybridization  temperature  was  52°C.  Hybridized  arrays  were  stained  and  washed  according  to 
Afiymetrix  protocol  “Modified  FlexMid i_euk2  v3  for  P.  aeruginosa  Array”  and  were  scanned 
using  an  Asymetrix  GeneChip  Scanner  3000. 

Microarray  data  was  first  processed  with  the  RMAExpress  software  program  (15)  to 
compute  gene  expression  summary  values.  Background  adjustment  and  quantile  normalization 
were  perfonned  across  all  microarrays.  The  average  nonnalized  expression  ratio  (dioxane  or 
glycolate  treatment/  pyruvate  control)  was  calculated  for  each  gene.  Ratios  were  considered 
significant  if  they  were  >2.0  or  <0.5  and  Student’s  t-test  indicated  a  p- value  (two-tailed)  of  <0.05. 

Quantitative  RT-PCR 

To  verily  microarray  data,  the  relative  transcription  of  17  genes  in  the  dioxane  and 
glycolate  treatments  were  compared  with  the  pyruvate  treatment.  cDNA  was  synthesized  from 
500  ng  of  RNA  using  the  TaqMan  reverse  transcription  kit  (Applied  Biosystems,  Foster  City, 

CA)  with  random  hexamers  in  15  joL  reactions  according  the  manufacturer’s  protocol.  cDNA 
was  diluted  100-fold  and  2  joL  were  used  in  each  subsequent  qPCR  reaction.  All  qPCR  reactions 
were  perfonned  in  triplicate  with  an  Applied  Biosystems  StepOne  Plus  real-time  PCR  system. 

For  most  genes,  SYBR  Green  chemistry  was  used,  and  each  20  joL  reaction  consisted  of  IX  Fast 
SYBR  Green  master  mix,  2  joL  of  diluted  cDNA  and  each  primer  at  0.5  mM.  The  PCR  cycling 
conditions  were:  95°C  for  20  s,  then  40  cycles  of  (95°C  for  3  s,  annealing  for  30  s),  followed  by 
melting  curve  analysis.  The  genes  tpi,  thiC,  and  rpoD  were  detennined  by  the  geNonn  method 
(163)  to  be  stably  transcnbed  across  all  treatments,  and  were  thus  used  as  internal  references. 
TaqMan  chemistry  was  used  for  these  genes,  and  each  qPCR  reaction  consisted  of  IX  Fast 
Universal  Mix  (Applied  Biosystems),  2  joL  of  diluted  cDNA,  each  primer  at  0.5  mM,  and  the 
probe  at  145  nM.  The  cycling  conditions  were:  95°C  for  20  s,  then  40  cycles  of  (95°C  for  1  s, 
annealing  for  20  s).  The  primer  sequences  and  dye  and  quencher  chemistry  for  each  probe  are 
listed  in  Table  5.1.  The  efficiency  of  each  qPCR  assay  was  detennined  with  a  serial  dilution  of 
cDNA  derived  from  a  dioxane  treatment  replicate.  Gene  expression  was  nonnalized  using  the 
method  of  Vandesompele  et  al.  (163). 
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Table  5.1.  SYBR  Green  and  TaqMan  primers  used  in  qRT-PCR  analyses.a'b 


Target 

Gene  ID  Gene  product  Sense  primer  sequence  Antisense  primer  sequence 


Psed_0629 

Methane  monooxygenase 

Psed_0768 

Phenol  2-monooxygenase 

M  e  thane  /phe  no  1/  to  lue  ne 

Psed_0815 

hydroxylase 

RNA  polymerase  sigma  factor 

Psed_3051 

RpoD 

Psed_3417 

Triosephosphate  isomerase 

Psed_3888 

Hydro  pyruvate  isomerase 

Psed_3889 

Tartronate  semialdehyde  reductase 

Psed_3890 

Glyoxylate  carboligase 

Psed_3891 

Gly cerate  kinase 

Psed_4782 

Malate  synthase 

Psed_4783 

Glycolate  oxidase 

Psed_4788 

Glycolate  oxidase 

Psed_6168 

Thiamine  biosynthesis  protein  C 

Psed_6782 

Alcohol  dehydrogenase 

Psed_6783 

Fatty-acid-coA  ligase 
FAD/FMN-de  pendent 

Psed_6970 

dehydrogenase 

Psed_6971 

Alcohol  dehydrogenase 

Psed_6975 

Aldehyde  dehydrogenase 
Monooxygenase  oxido reductase 

Psed_6976 

domain 

GAACCCGGAGCTGCACAA 

GCTCTTCGTGCCCTTCATGA 

GGTGGAAGGACCGTGACAAC 

TGATCCAGGAGGGCAACCT 

CCGGCAACTGGAAGATGAAC 

CTTCCTGTTCGACCTGTTCCA 

ACATGCTGAACCGGGACTTC 

GCGCACCACGCACTACGACC 

GATCGAGCAGGCCTACGC 

CGAGCAGACGGTCCGGGAGA 

GGCCGCATCTCCCCCGACTA 

CCGACCTCGCGACGATGACG 

TGAAGGTCAACGCCAACATC 

GCAGGTCGAGTCAAGGACTTTC 

CTCTAGTGGACGAGGAGGAACTG 

G  AGTTGACTTCGATCTG  CTTACGA 
TTG  CT  GG  GAT  GGTG  AAGG  A 
TGACATCGAGCAGGCTGTTG 

GGCTGGGACCGGACGTA 

GGAGCGCCGGTTATTATCAA 


Psed_6981  Aldehyde  dehydrogenase 
a  All  primers  sets  were  annealed  at  60°C,  except  for  Psed_4783,  which  was 
annealed  at  65°C.  b  Except  for  where  probe  sequences  are  listed,  all  PCR 
reactions  were  performed  with  SYBR-Green  chemistry 


TTC  AG  GTTCTG  CT  GG  AT  CG  T 
GAGAACCCGAACGACATCGT 

GGCCAGGAAGCGATCGAT 

T  AG  CCCTT  GGT  GTAGTCG  AACTT 

GGCGAGCTTCTGCAGGAA 

TGCACGTGGGCGATCAC 

CTTC  ATGTCCTTGT  GGTG  CAA 

CGGCCGAAGACCCGGTTCATC 

TGCGAGCTCCT  CC  AGC  AG 

GTCGCTCATCGGCCGGTAGC 

CTCGCCCATGTCCGCGATCC 

GAGGGGTGGGCAGCAGCTTG 

TGGCCCACACCAGTTTCTC 

GCCGTGCGGCACCAT 

TCG  CTT  AGGC  ACCTTGTA  ACC 

GGCCGCATCAACCATGAA 
CC  ATG  AG  GCA  CC  ATTGCTT 
CAGGTCTGCCCGTTGTTGT 

TTGTTCGGCAGCTTGTATCG 

TGACCCAAGAACAGCGATGA 


Probe  sequence 


(TET)CTGATCCGTGCGGTC(TAMRA) 

(VIC)CACCTCGAGGCCATC(TAMRA) 


(FAM)CGTCGATCGAGGACG(TAMRA) 


(TET)TACCCCCATGAGCGCGGGAA(TAMRA) 
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Cloning  and  expression  of  strain  CB1190  genes 

Strain  CB1190  genes  Psed_3889  and  Psed_3890  were  PCR  amplified  and  ligated  into 
linearized  Rhodococcus  expression  plasmid  pTip-QC2  (102)  using  the  hetero- stagger  cloning 
method  (85).  Rhodococcus  jostii  strain  RHA1  was  transformed  with  pTip-Psed_3889,  pTip- 
Psed_3890,  or  pTip-QC2,  and  then  expression  of  the  inserted  gene  was  induced  by  the  addition 
of  thiostrepton  to  the  culture  medium.  Cell-free  extracts  from  strain  RHAl/pTip  cells  were 
prepared  as  described  below  for  strain  CB1 190  cells. 

Glyoxylate  carboligase  assay 

Strain  CB1190  cells  grown  with  either  pyruvate  ordioxane  were  harvested  by  filtration 
and  scraped  cells  were  stored  at  -80°C.  Cell-free  extracts  were  prepared  by  resuspending  the 
cells  in  sodium  phosphate  buffer  (10  mM,  pH  7.0),  adding  zirconia-silica  beads,  and  bead 
beating.  Cell  debris  was  collected  by  centrifugation,  and  the  supernatant  (cell-free  extract)  was 
aliquoted  and  stored  at-80°C  until  use.  Protein  was  determined  by  the  method  of  Bradford  (17). 

Glyoxylate  carboligase  activity  was  tested  using  the  linked  glyoxylate 
carboligase/tartronate  semialdehyde  reductase  assay  described  by  Cusa  et  aL  (28).  Briefly,  2  mL 
reactions,  containing  100  mM  sodium  phosphate  (pH  8.0),  5  mM  MgCh,  0.28  mM  NADH,  0.5 
mM  thiamine  pyrophosphate,  and  200  pg  of  protein  from  cell- free  extracts,  were  prepared  in 
triplicate  in  screw-top  glass  vials  with  Mininert  caps.  The  headspace  was  with  N2,  glyoxylate 
was  added  to  10  mM,  and  vials  were  incubated  with  shaking  at  30°C  for  16  h.  Reactions  were 
stopped  and  CO2  liberated  by  the  addition  of  10  N  sulfuric  acid  and  shaking  for  15  min.  CO2  was 
analyzed  by  gas  chromatography- pulsed  discharge  detection  (GC-PDD)  and  glyoxylate  was 
analyzed  by  HPLC. 

For  assays  with  cell- free  extracts  from  strain  RHA1  carrying  pTip  plasmid  constructs,  the 
assay  vials  were  prepared  as  above  for  the  following  treatments:  1)  pTip-Psed_3889  (8  pg  of 
protein);  2)  pTip-Psed_3890  (7  pg  of  protein);  3)  pTip-Psed_3889  +  pTip-Psed_3890  (8  pg  and 
7  pg  of  protein  of  each,  respectively);  4)pTip-QC2  (10  pg  of  protein);  5)  Buffer  control.  CO2 
production  was  determined  by  GC-PDD,  and  glyoxylate  consumption  and  glycerate  production 
were  determined  by  HPLC. 

Growth  with  l,4-[U-13C]dioxane 

A  series  of  strain  CB1 190  cultures  was  prepared  with  l,4-[U-13C]dioxane  (99%  pure; 
Sigma-Aldrich).  Serum  bottles  with  black  butyl  rubber  stoppers  and  containing  AMS  medium 
were  amended  with  13C-dioxane,  or  13C-dioxane  plus  3%  CCL.  These  bottles  were  inoculated 
with  1/300  dilution  13C-dioxane-grown  strain  CB1190  culture  and  were  incubated  at  30°C  with 
shaking.  Following  complete  removal  of  the  dioxane,  cells  from  each  treatment  were  transferred 
to  fresh  triplicate  bottles  with  the  same  amendments  and  were  incubated  with  shaking  until 
dioxane  removal  was  complete.  Cells  were  then  harvested  by  filtration,  scraped  from  the  filter, 
and  stored  at  -80°C  until  isotopomer  amino  acid  analysis. 

Analytical  methods 

Pyruvate,  glycolate  and  glyoxylate  consumption  and  glycerate  production  were  monitored 
with  a  Waters  (Milford,  MA)  model  2695  HPLC  equipped  with  a  model  2996  photodiode  array 
detector  and  an  Amidex  HPX-87H  column  (Bio-Rad,  Hercules,  C A).  Dioxane  consumption  was 
monitored  by  direct  injection  of  samples  onto  a  Varian  3400  GC  equipped  with  a  FID  detector 
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and  a  1%  AT- 1000  on  carbographl  packed  column  (Grace,  Columbia,  MD).  CO2  production  in 
cell- free  extracts  was  determined  by  injecting  headspace  samples  onto  a  HP  6890  GC  equipped 
with  a  PDD  detector,  a  1.5  m  Hayesep-DB  100/120  pre-column  and  a  2  mHayesep-DB  120/140 
column. 

The  preparation  and  isotopomeric  analysis  of  proteinogenic  amino  acids  were  perfonned  as 
previously  described.  Two  types  of  charged  fragments  were  clearly  detected  by  GC-MS  for  the 
derivatized  amino  acids:  the  [M- 57]+ which  contains  the  entire  amino  acid;  and  the  [M-159]+, 
which  contains  the  amino  acid  without  the  first  carbon  (a  carboxyl  group).  The  [M- 57]+ peaks 
in  leucine,  isoleucine  and  pro  line  overlap  with  other  peaks,  so  the  [M-159]+  group  was  used  to 
obtain  the  isotopomer  labeling  information  of  those  amino  acids.  The  final  isotopomer  labeling 
fractions  were  indicated  as:  M0  (unlabeled  fraction),  Ml  (single  labeled  carbon  fraction),  M2 
(fraction  with  two  labeled  carbons),  M3  (fraction  with  three  labeled  carbons),  and  so  forth. 

Results  and  Discussion 

In  silico  analysis  of  dioxane  degradation  pathway  genes 

The  pathway  for  dioxane  degradation  that  was  proposed  by  Mahendra  etal.  (91)  (Fig.  5.1, 
black  arrows)  was  the  starting  point  for  this  study.  We  searched  the  literature  for  descriptions  of 
enzymatic  activities  catalyzing  the  proposed  dioxane  metabolic  transformations,  and  used  the 
amino  acid  sequence  of  characterized  enzymes  to  identify  homologues  in  the  strain  CB1190 
genome  sequence  using  blastp  (4). 

Dioxane  degradation  is  initiated  by  the  activity  of  a  monooxygenase  (89,  161).  The  strain 
CB1 190  genome  carries  eight  bacterial  multi-component  monooxygenase  gene  clusters,  seven  of 
which  are  encoded  on  the  chromosome  and  one  that  is  on  plasmid  pPSED02  (132). 

The  transformation  of  the  initial  hydroxylated  dioxane  metabolite  is  proposed  to  involve 
spontaneous  ether  cleavage  and/or  aldehyde/carboxylic  acid  and/or  secondary  alcohol/ketone 
conversions  (91).  Strain  CB1190  has  13  genes  encoding  proteins  with  at  least  30%  amino  acid 
identity  with  characterized  bacterial  aldehyde  dehydrogenases  (E.  coli  AldA  [accession 
NP_4 15933],  E.  coli  AldB  [accession  AAC76612],  Rhodococcus  erythropolis  AldH  [accession 
AAZ14956]),  which  catalyze  the  NAD(P)+-dependent  oxidation  of  aldehydes  to  carboxylic  acids. 
Eleven  of  these  genes  are  located  on  the  chromosome,  while  the  remaining  two  are  on  plasmid 
pPSED02.  Three  genes  encoding  proteins  with  29-36%  amino  acid  identity  to  a  secondary 
alcohol  dehydrogenase  (accession  CAD36475)  of  the  actinomycete  Rhodococcus  ruber  are  on 
the  chromosome;  this  enzyme  catalyzes  the  NAD+- dependent  translbrmation  of  secondary 
alcohols  to  ketones  (89,  161). 

The  second  proposed  hydroxylation  reaction  generating  single  ether-containing  2- 
hydroxyethoxy-2-hydroxyacetic  acid  or  1,2-dihydroxyethoxyacetic  acid  has  also  been  postulated 
to  be  catalyzed  by  a  monooxygenase  (91)  (see  Fig.  5.1). 

The  subsequent  cleavage  of  the  second  ether  bond  leads  to  the  production  of  two-carbon 
intermediates,  of  which  glyoxal,  ethylene  glycol,  glycoaldehyde,  glycolate,  glyoxylate  and 
oxalate  have  been  identified  (91).  The  strain  CB1190  genome  has  three  genes  encoding  proteins 
with  27-34%  amino  acid  identity  to  E.  coli  1,2-propanediol  oxidoreductase  (accession 
AP_003365),  which  transforms  ethylene  glycol  to  glycoaldehyde  (26).  Glycoaldehyde  is 
transfonned  to  glycolate  by  aldehyde  dehydrogenases  (see  above),  while  glycolate  is  converted 
to  glyoxylate  by  glycolate  oxidase  (70).  In  E.  coli,  glycolate  oxidase  is  encoded  by  glcDEF 
(116);  strain  CB1190has  two  adjacent  homologues,  glcDlElFl  and  glcD2E2F2  { Fig.  5.2). 
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These  homologues  share  72%,  47%,  and  64%  amino  acid  identity  for  GlcD,  GlcE  and  GlcF, 
respectively. 

Mahendra  etaL  (91)  proposed  that  glyoxal  is  transfonned  to  glyoxylate,  and  although  this 
enzymatic  activity  has  not  been  previously  described  in  the  literature,  the  numerous  aldehyde 
dehydrogenases  encoded  in  the  strain  CB1190  genome  could  potentially  catalyze  this  reaction. 
Alternatively,  genes  encoding  aldehyde  reductase  homologues  (E.  coli  YqhD  [accession 
Q46856],  Bacillus  subtilis  YvgN  [accession  032210])  that  catalyze  the  transformation  of  glyoxal 
to  glycoaldehyde  were  identified.  Furthermore,  in  E.  coli  the  glyoxalase  I/II  system,  encoded  by 
the  genes  gloA  and  gloB,  uses  S-glutathione  to  transform  glyoxal  to  glycolate  (155);  however,  no 
homologous  genes  were  identified  in  the  strain  CB1190  genome. 

Since  glyoxylate  has  been  identified  as  a  metabolite  of  dioxane  (91,  101),  we  examined  the 
strain  CB1 190  genome  for  genes  involved  in  known  glyoxylate  assimilation  pathways.  Strain 
CB1 190  has  a  cluster  of  genes  (Fig.  5.2)  homologous  to  those  encoding  the  glyoxylate 
carboligase  pathway  that  converts  glyoxylate  through  tartronate  semialdehyde  and  glycerate  to 
phosphoglycerate,  in  order  to  provide  both  carbon  and  energy  for  growth  in  E.  coli  (73).  Strain 
CB1190  also  has  a  gene  (Psed_4782)  that  encodes  for  malate  synthase  G,  which  in  E.  coli  (and 
in  strain  CB1 190  -  see  Fig.  5.2)  is  linked  to  the  locus  encoding  glycolate  oxidase  and  which 
assimilates  glyoxylate  directly  into  the  TCA  cycle  via  malate  (71,  99).  Malate  synthase  G  is  non- 
essential  for  the  growth  of  E.  coli  on  glyoxylate,  and  is  therefore  considered  to  play  an 
anaplerotic  role  (110). 

While  the  transfonnation  of  glyoxylate  to  oxalate  has  been  reported  in  Pseudomonas 
fluorescens  (140),  the  gene  sequence  for  the  enzyme  catalyzing  this  reaction  has  not  yet  been 
determined  in  bacteria.  Finally,  aerobic  bacterial  growth  on  oxalate  was  shown  to  be  dependent 
on  the  oxalate  decarboxylase  OxdC,  which  converts  oxalate  to  formate  (150).  However,  the 
strain  CB1 190  genome  lacks  a  homologue  for  this  enzyme. 

Gene  expression  during  growth  on  dioxane,  glycolate  and  pyruvate 

We  used  gene  expression  microarrays  targeting  the  strain  CB1 190  genome  to  determine 
the  genes  that  are  differentia  lly  regulated  during  growth  on  dioxane  or  glycolate  relative  to 
growth  on  pyruvate.  The  expression  of  383  genes  differed  significantly;  97  genes  were  up- 
regulated  with  dioxane  relative  to  the  pyruvate  control,  whereas  286  genes  were  down-regulated. 
When  strain  CB1190  was  grown  with  glycolate,  the  expression  of  506  genes  differed 
significantly  relative  to  pyruvate- grown  cells,  with  203  genes  up-regulated  and  303  down- 
regulated.  A  comparison  of  genes  differentially  regulated  with  either  dioxane  or  glycolate 
relative  to  pyruvate  identified  36  genes  that  were  up-regulated  with  both  substrates,  whereas  121 
genes  were  down- regulated  with  both  substrates.  Table  5.2  provides  a  list  of  genes  that  were  up- 
regulated  with  dioxane  alone  or  with  both  dioxane  and  glycolate. 

The  replicon  location  of  the  dioxane-  and  glycolate- induced  genes  was  examined.  For 
dioxane- grown  cells,  27  and  17  genes  were  induced  from  plasmids  pPSEDOl  and  pPSED02, 
respectively,  with  the  remaining  induced  genes  on  the  chromosome.  For  glycolate-grown  cells, 

15  induced  genes  were  from  plasmid  pPSEDOl,  while  the  rest  were  on  the  chromosome;  no 
genes  were  induced  from  plasmid  pPSED02. 

Sixty-one  genes  were  up-regulated  with  dioxane  but  not  with  glycolate,  relative  to 
pyruvate  (Table  5.2).  Of  these  genes,  21  were  from  plasmid  pPSEDOl,  while  17  genes  were  from 
plasmid  pPSED02.  Of  the  dioxane- induced  plasmid  pPSEDOl  genes,  14  of  these  encode 
hypothetical  proteins. 
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Table  5.2.  Strain  CB 1190  genes  on  microarray  up-regulated  during  growth  with  dioxane 


Expression  ratio1 


Gene  ID 

Gene 

name 

Gene  product 

Repliconb 

Dioxane/ 

Pyruvate 

Glycol  ate/ 
Pyruvate 

Psed_0038 

Regulatory  protein  ArsR 

c 

2.0 

0.37 

Psed_0350 

Bile  acid:sodium  symporter 

c 

2.9 

13 

Psed  1076 

Short-chain  dehydrogenase/reductase  SDR 

c 

2.6 

1.7 

Psed_1302 

Protein  of  unknown  function  DUF 156 

c 

3.2 

11 

Psed_1303 

Heavy  metal  transport/detoxification  protein 

c 

2.7 

8.0 

Psed_1304 

Heavy  metal  translocating  P-type  ATPase 

c 

2.5 

9.7 

Psed_1584 

6-Phosphofructokinase 

c 

3.3 

3.9 

Psed_1594 

Protein  of  unknown  function  UPF0016 
Trimethylamine-N-oxide  reductase  (cytochrome 

c 

2.3 

1.2 

Psed  1658 

c) 

c 

5.3 

0.63 

Psed_2030 

Linalool  8-monooxygenase 

c 

2.2 

2.4 

Psed_2371 

ABC-type  transporter,  perip  lasmic  subunit 

NADH  dehydrogenase  (ubiquinone)  24  kDa 

c 

2.0 

3.1 

Psed  3041 

subunit 

c 

2.0 

1.1 

Psed_3522 

Response  regulator  receiver 

c 

3.5 

3.9 

Psed  3564 

Aromatic-amino-acid  transaminase 

c 

3.0 

1.2 

Psed__3888 

hyi 

Hy  droxypy  ruvate  isomerase 

c 

106 

112 

Psed_3889 

glxR 

Tartronate  semialdehyde  reductase 

c 

114 

114 

Psed_3890 

gel 

Gly  oxy  late  carboligpse 

c 

98 

108 

Psed__3891 

glxK 

Gly cerate  kinase 

c 

49 

56 

Psed  3934 

Major  facilitator  superfamily  MFS  l 

c 

2.4 

1.6 

Psed  3935 

GntR  domain  protein 

c 

2.5 

1.1 

Psed_4031 

Iron-sulfur  cluster  binding  protein 

c 

2.1 

2.8 

Psed  4146 

Potassium-transporting  ATPase  B  chain 

c 

2.6 

1.2 

Psed  4147 

Potassium-transporting  ATPase  A  chain 

c 

2.6 

1.2 

Psed_4148 

K+-transporting  ATPase,  F  subunit 

c 

3.0 

1.3 

Psed_4512 

ABC-type  transporter,  integral  membrane  subunit 

c 

2.2 

2.7 

Psed_4513 

Cobalamin  (vitamin  B12)  biosynthesis  CbiX  protein 

c 

2.1 

2.4 

Psed_4577 

Cold-shock  protein  DNA-binding 

c 

2.2 

3.7 

Psed_4755 

Ammonium  transporter 

c 

2.2 

9.3 

Psed_4782 

glcB 

Malate  synthase 

c 

10 

11 

Psed_4788 

glcD2 

D-lactate  dehydrogenase  (cytochrome) 

c 

43 

44 

Psed_4789 

glcE2 

FAD  linked  oxidase  domain  protein 

c 

24 

24 

Psed_4790 

glcF2 

Protein  of  unknown  function  DUF224  cysteine-rich 
region  domain  protein 

c 

27 

26 

Psed_5025 

4-  Hy  droxy  acet  op  henone  monooxy  genas  e 

c 

5.8 

3.6 

Psed_5026 

Regulatory  protein  TetR 

c 

2.4 

2.8 

Psed  5135 

Malic  protein  NAD-binding 

c 

2.6 

1.4 

Psed_5406 

NAD(P)(+)  transhydrogenase  (AB-specific) 

c 

2.1 

2.2 

Psed  5524 

Hydrolase 

c 

2.1 

1.3 

Psed_5736 

Transglycosylase-like  domain  protein 

Glycoside  hydrolase  family  13  domain-containing 

c 

2.8 

8.4 

Psed  5938 

protein 

c 

2.3 

1.5 

Psed  6175 

Regulatory  protein  TetR 

c 

3.5 

4.0 

Psed  6259 

FMN-de pendent  oxidoreductase,  nitrilotriacetate 
monooxygenase  family 

ABC-type  transporter,  integral  membrane 

c 

2.5 

1.2 

Psed  6261 

subunit 

c 

2.3 

1.1 

Psed  6262 

ABC-type  transporter,  integral  membrane 

c 

2.2 

0.97 
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Psed_6263 

subunit 

Extracellular  ligand-binding  receptor 

c 

2.2 

0.92 

Psed_6600 

Acyl-CoA  dehydrogenase  domain-containing 
protein 

c 

4.2 

0.95 

Psed_6728 

Amidase 

Pi 

2.6 

0.99 

Psed_6730 

Luciferase-like,  subgroup 

Pi 

2.8 

0.72 

Psed_6732 

MaoC  domain  protein  dehydratase 

Pi 

2.2 

1.0 

Psed_6735 

Flavoprotein  WrbA 

Pi 

2.1 

1.1 

Psed_6742 

NLP/P60  protein 

Pi 

2.7 

1.1 

Psed_6745 

ATP-binding  protein 

Pi 

3.7 

0.72 

Psed_6751 

Transglycosylase-like  domain  protein 

Pi 

5.0 

0.60 

Psed_6779 

Integrase  catalytic  region 

pl 

6.9 

5.4 

Psed_6782 

Hydroxy acid-oxoacid  transhydrogenase 

pl 

4.6 

4.3 

Psed_6784 

Alky  lglycerone-phosp hate  synthase 

pl 

3.5 

3.5 

Psed_6787 

Formyl-CoA  transferase 

pl 

13 

11.3 

Psed__6791 

IstB  domain  protein  ATP-binding  protein 

Pl 

3.5 

2.4 

Psed_6799 

Transposase  IS  4  family  protein 

Pl 

2.0 

1.7 

Psed_6970 

D-lactate  dehydrogenase  (cytochrome) 

p2 

6.3 

0.67 

Psed_6971 

Hydroxyacid-oxoacid  transhydrogenase 

p2 

5.1 

0.69 

Psed_6972 

GntR  domain  protein 

p2 

3.4 

0.62 

Psed_6974 

Ethyl  tert-butyl  ether  degradation  EthD 

p2 

3.2 

0.71 

Psed_6975 

Betaine-aldehyde  dehydrogenase 

p2 

2.6 

0.74 

Psed_6977 

thmD 

Ferredoxin— NAD(+)  reductase 

p2 

4.2 

0.72 

Psed_6978 

tli  m  B 

Methane/phenol/toluene  hydroxylase 

p2 

2.7 

0.80 

Psed_6979 

thmC 

Monooxygenase  component  MmoB/DmpM 

p2 

3.7 

0.76 

Psed_6981 

Aldehyde  Dehydrogenase 

p2 

9.5 

0.83 

Psed_6982 

Mn2+/Fe2+  transporter,  NRAMP  family 

p2 

14 

1.0 

Psed  7002 

Transcription  factor  WhiB 

p2 

2.7 

0.67 

a  Only  annotated  genes  with  predicted  functions  are  shown.  The  following  genes  (Psed_XXXX)  were 
automatically  annotated  as  “hypothetical  protein”:  (chromosome)  2752,3653,3669,4149,4306,4424, 
5371;  (plasmid  pPSEDOl)  6729, 6743, 6744, 6746, 6747, 6748, 6749, 6750, 6752, 6753, 6754, 6755, 
6757, 6758, 6780;  (plasmid  pPSED02)  6913, 6973, 6980, 7003, 7007, 7008 
b  c  =  chromosome;  pi  =  plasmid  pPSEDOl;  p2  =  plasmid  pPSED02 
c  Genes  up-regulated  with  dioxane  but  not  with  glycolate  are  indicated  in  bold 

Upregulation  of  a  multi-component  monooxygenase  gene  cluster  during  dioxane 
metabolism 

While  the  strain  CB1190  genome  encodes  eight  monooxygenase  gene  clusters, 
microarray  analysis  of  dioxane-induced  transcription  revealed  that  only  the  plasmid  pPSED02- 
encoded  monooxygenase  cluster  was  up-regulated  when  compared  with  growth  with  pyruvate 
(Table  5.2).  Upregulation  of  this  cluster  was  verified  with  qRT-PCR  using  primers  targeting 
Psed_6976,  encoding  a  hotnologue  ofTHF  monooxygenase  a-subunit  (Table  5.3).  Three 
chromosoma  11  y-encoded  monooxygenase  gene  clusters  were  missing  from  the  microarray,  so 
their  potential  involvement  in  dioxane  metabolism  was  tested  using  qRT-PCR  with  primers 
targeting  the  genes  encoding  the  a-subunits.  As  Table  5.3  shows,  no  significant  differential 
transcription  was  observed  for  these  three  monooxygenase  genes  (Psed_0629,  Psed_0768,  and 
Psed_08 15)  in  cDNA  from  dioxane-,  glycolate-  or  pyruvate- grown  cells. 
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The  dioxane- induced,  plasmid  pPSED02-endoded  monooxygenase  gene  cluster  is 
homologous  to  the  thm  gene  cluster  involved  in  THF  utilization  in  Pseudonocar dia 
tetrahydrofuranoxydans  strain  K1  (154).  This  gene  cluster  has  also  been  identified  in  DNA 
extracted  from  Pseudonocar  dia  strain  ENV478  (93),  and  in  both  strains  K1  and  ENV478, 
growth  with  THF  induces  transcription  of  genes  in  this  cluster  (93,  154).  Although  strains  K1 
and  ENV478  degrade  dioxane  co-metabolically  when  induced  with  THF  (89,  161),  the  published 
results  did  not  directly  link  the  THF-induced  monooxygenase  gene  cluster  to  biochemical 
dioxane  transformation  activity.  However,  the  up-regulation  of  the  homologous  thm 
monooxygenase  gene  cluster  in  strain  CB1 190  during  dioxane  metabolism  reported  here 
supports  the  involvement  of  the  thm  monooxygenase  gene  clusters  in  dioxane  degradation  in 
strain  K1  and  ENV478. 

Table  5.3.  Normalized  relative  expression  levels  for  selected  strain  CB1190  genes  analyzed  by  qRT- 
PCR. 

Expression  ratio1’ 


Gene 

Dioxane/ 

Glycolate/ 

Gene  ID 

name 

Gene  product 

Replicon3 

Pyruvate 

Pyruvate 

THF  monooxygenase 

Psed_6976 

thm  A 

a-subunit 

p2 

15  ±0.2 

1.0  ±0.5 

Monooxygenase 

Methane 

genes  lacking 

Psed_0629 

prmA 

monooxygenase 

c 

0.4  ±0.1 

0.8  ±  0.5 

probes  on 

Phenol  2- 

microarray 

Psed_0768 

monooxygenase 

c 

2  ±  2 

0.9  ±  0.4 

Methane/phenol/toluene 

Psed  0815 

hydroxylase 

c 

1.1  ±0.5 

0.3  +  0.4 

Psed_6782 

Alcohol  dehydrogenase 

pi 

7.3  ±  0.8 

3.3  ±  0.4 

Psed_6783 

Fatty-acid-CoA  ligase 

pi 

24  ±2 

7  ±  5 

Dioxane 

FAD/FMN  -  de  pen  de  nt 

metabolism 

Psed_6970 

dehydrogenase 

p2 

6.1  ±  0.8 

1.5  ±0.9 

intermediates - 

Psed_6971 

Alcohol  dehydrogenase 

p2 

8.7  ±  0.8 

1.3  ±0.4 

related  genes 

Aldehyde 

Psed_6975 

dehydrogenase 

p2 

23  ±2 

0.9  ±  0.3 

Aldehyde 

Psed  6981 

dehydrogenase 

p2 

11  ±  1 

1.5  ±0.9 

glcD 

Psed_4783 

1 

Glycolate  oxidase 

c 

0.3  ±0.1 

0.7  ±  0.5 

glcD 

Psed_4788 

2 

Glycolate  oxidase 

c 

3.6  ±  0.3 

2.8  ±  0.5 

Glycolate  and 

Psed_4782 

glcB 

Malate  synthase  G 

c 

5.4  ±  0.2 

2  ±  1 

glycolate 
metabolism- 
related  genes 

Psed_3888 

hyi 

Hydro  xypyruvate 
isomerase 

c 

5  ±  1 

3.1  ±0.3 

Tartronate  semialdehyde 

Psed_3889 

glxR 

reductase 

c 

13  ±  1 

6  ±  2 

Psed_3890 

gel 

Qyoxylate  carboligase 

c 

18  ±2 

10  ±4 

Psed  3891 

glxK 

Glycerate  kinase 

c 

1.9  ±0.6 

1.9  ±0.7 

a  c  =  chromosome;  pi  =  plasmid  pPSEDOl;  p2  =  plasmid  pPSED02 
b  Genes  up-regulated  with  dioxane  but  not  with  glycolate  are  indicated  in  bold 
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Genes  potentially  contributing  to  transformation  of  C4  dioxane  metabolites 

After  the  initial  monooxygenase-catalyzed  hydroxylation,  the  C4  dioxane  metabolites 
undergo  a  series  of  transformations  that  ultimately  yield  two  C2  compounds  (Fig.  5.1).  The  set  of 
genes  on  plasmid  pPSED02  that  are  induced  by  dioxane  but  not  by  glycolate  could  potentially  be 
involved  in  these  translbr  mat  ions  (Table  5.2).  The  operon  containing  the  thm  monooxygenase¬ 
encoding  gene  cluster  also  includes  genes  encoding  two  aldehyde  dehydrogenases  (Psed_6975 
and  Psed_6981),  and  upstream  of  the  monooxygenase  cluster  and  in  the  reverse  orientation  is  an 
upregulated  gene  cluster  encoding  an  alcohol  dehydrogenase  and  a  FAD/FMN- dependent 
dehydrogenase,  as  well  as  a  transcriptional  regulator  (Psed_6970-6972).  Dioxane-dependent 
upregulation  of  these  genes  was  verified  by  qRT-PCR  (Table  5.3).  The  aldehyde-carboxylic  acid 
conversion  of  C4  dioxane  metabolites  could  be  catalyzed  by  these  dioxane-induced  aldehyde 
dehydrogenases.  In  contrast,  a  candidate  gene  encoding  an  enzyme  for  the  secondary  alcohol- 
ketone  conversion  of  C4  metabolites  (/.<?.,  2- hydroxy- 1 ,4- dioxane  to  l,4-dioxane-2-one)  is  not 
immediately  clear,  since  none  of  the  secondary  alcohol  dehydrogenase  genes  identified  by  in 
silico  analysis  were  specifically  up-regulated  with  dioxane.  The  alcohol  dehydrogenase-encoding 
Psed_6971  or  constitutive  ly  expressed  secondary  alcohol  dehydrogenases  (not  identified  in  the 
differential-transcription  analyses  reported  here)  could  be  involved  in  this  reaction.  We  are 
currently  cloning  the  up-regulated  dehydrogenases  and  reductases  to  functionally  test  their  role 
in  the  transformation  of  C4  dioxane  metabolites. 

The  dioxane  metabolic  pathway  also  requires  a  second  hydroxylation  and  two  ether 
cleavage  steps.  Ether  cleavage  may  occur  spontaneously  following  hydroxylation,  or  it  may 
occur  enzymatically  (167).  A  heterogeneous  group  of  ether  cleaving  enzymes  have  been 
identified  (reviewed  in  White  etaL  (167)),  and  an  ether-bond  cleaving  diglycolic  acid 
dehydrogenase  has  been  purified  from  P.  tetrahydrofuranoxydans  strain  K1  (172).  However, 
genomic  and  transcrip  to  mic  analysis  of  dioxane  metabolism  did  not  identify  such  enzymes  in 
strain  CB1190. 

Genes  potentially  contributing  to  transformation  of  C2  dioxane  metabolites  to  glyoxylate 

The  plasmid  pPSED02-encoded  putative  aldehyde  and  alcohol  dehydrogenases  that  were 
up-regulated  during  growth  on  dioxane  but  not  on  glycolate  could  also  contribute  to  the 
transfonnation  of  the  dioxane  metabolites  glyoxal,  glycoaldehyde  and  ethylene  glycol.  In  E.  coli 
the  aldehyde  reductase  YqhD  catalyzes  the  NADPH-dependent  reduction  of  glyoxal  to 
glycoaldehyde  (81),  while  in  Bacillus  subtilis  the  non- ho  mo  logo  us  YvgN  catalyzes  the  same 
reaction  (130).  The  up-regulated  aldehyde  dehydrogenase  gene  products  share  only  low  amino 
acid  sequence  identity  with  these  proteins,  but  Psed_6975  has  35%  identity  to  E.  coli  AldA, 
which  catalyzes  the  NAD+-dependent  oxidation  of  glycoaldehyde  to  glycolate  (53).  Boronat  el  al. 
(16)  identified  an  E.  coli  mutant  that  employed  the  activity  of  propanediol  oxidoreductase  (fucO 
(26))  to  transform  ethylene  glycol  to  glycoaldehyde,  which  was  further  transformed  to  glycolate. 
The  putative  alcohol  dehydrogenase  Psed_697 1  that  was  up-regulated  with  dioxane  shares  42% 
amino  acid  similarity  to  E.  coli  1 ,2-propanediol  oxidoreductase,  so  the  gene  product  may 
catalyze  the  conversion  of  ethylene  glycol  to  glycoaldehyde  during  dioxane  metabolism.  It 
should  be  noted  that  product  of  the  glycolate-  and  dioxane-induced  alcohol  dehydrogenase  gene 
Psed_6782  has  similar  amino  acid  identity  to  E.  coli  propanediol  oxidoreductase,  although  its 
role  in  glycolate  metabolism  is  not  clear. 

The  set  of  genes  up-regulated  with  both  dioxane  and  glycolate  relative  to  pyruvate 
included  the  putative  glycolate  oxidase-encoding  chromosomal  gene  cluster  glcD2E2F2 
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(Psed_4788-4790)  (Table  5.2).  In  contrast,  the  adjacent  homologous  glcDlElFX  gene  cluster 
was  not  up-regulated  with  either  dioxane  orglycolate.  This  lack  of  up-regulation  was  confirmed 
with  qRT-PCR  (Table  5.3).  Therefore,  the  enzyme  encoded  by  g!cD2E2F2  likely  catalyzes  the 
transfonnation  of  glycolate  to  glyoxylate  (116).  In  E.  coli,  genes  for  malate  synthase  G  ( glcB ) 
and  glcDEF  are  proximal,  oriented  in  the  same  direction,  and  are  co-transcribed  during  growth 
with  glycolate  (117).  While  glcB  (Psed_4782)  and  glcD2E2F2  are  in  close  proximity  in  strain 
CB1190,  they  have  opposing  orientation  and  are  separated  by  the  homologous  but  non- identical 
glcDlElFl  (Psed_4783-4785)  cluster  that  was  not  up-regulated  during  growth  with  either 
dioxane  orglycolate  (Fig.  5.2).  Between  glcD2E2F2  and  glcDlElFl  is  a  gene  (Psed_4787) 
encoding  a  putative  GntR  family  transcriptional  regulator,  which  may  be  involved  in  regulation 
of  this  glycolate  oxidase  homologue. 

Gyloxylate  metabolism  during  dioxane  degradation  by  strain  CB1190 

Genes  encoding  two  divergent  routes  for  glyoxylate  assimilation  were  up-regulated  with 
both  dioxane  and  glycolate,  as  detennined  by  microarray  analysis.  The  first  route  was  through 
the  g/c/fi  encoded  malate  synthase  G  (Psed_4782)  (Table  5.2),  although  qRT-PCR  did  not 
confirm  this  up-regulation  (Table  5.3).  The  up-regulated  gene  cluster  encoding  glyoxylate 
carboligase,  tartronate  semialdehyde  reductase,  hydroxypyruvate  isomerase  and  glycerate  kinase 
(Psed_3888-3891;  Fig.  5.2)  represents  the  second  route  for  glyoxylate  assimilation.  The  up- 
regulation  of  three  genes  from  this  cluster  was  verified  by  qRT-PCR  (Table  5.3). 

These  transcriptional  results  point  to  glyoxylate,  rather  than  oxalate  (Fig.  5.1),  as  the  key 
intermediate  in  the  assimilation  of  carbon  into  central  metabolism  during  dioxane  degradation  by 
strain  CB1 190.  In  the  glyoxylate  carboligase  pathway,  two  glyoxylate  molecules  are  combined 
by  glyoxylate  carboligase,  producing  a  C3-compound  that  is  eventually  incorporated  into 
glycolysis  atphosphoglycerate  (73)  (Figs.  5.1  and  5.2).  In  the  typical  malate  synthase  G  pathway, 
glyoxylate  and  acetyl-CoA  are  condensed  to  fonn  malate,  which  is  incorporated  in  the  TCA 
cycle  (71)  (Figs.  5.1  and  5.2).  As  noted  above,  the  glyoxylate  carboligase  pathway  supports 
growth  through  carbon  and  energy  conservation,  whereas  malate  synthase  G  is  involved  in 
anaplerotic  reactions  but  is  not  known  to  support  growth  in  the  absence  of  glyoxylate  carboligase 
with  glyoxylate  as  a  sole  substrate  (110).  Therefore,  the  growth  of  strain  CB1 190  on  dioxane  is 
likely  dependent  on  the  glyoxylate  carboligase  pathway,  and  is  potentially  enhanced  by  malate 
synthase  G. 
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Figure  5.2.  Strain  CB1190  chromosomal  regions  of  interest  implicated  in  glycolate  and  glyoxylate 
transformations  during  dioxane  metabolism.  A)  Gene  clusters  implicated  in  glycolate  and  glyoxylate 
metabolism.  B)  Proposed  transformations  of  glycolate  and  glyoxylate  catalyzed  by  enzymes  putatively 
encoded  by  strain  CB 1 190  gene  clusters. 

Since  strain  CB1190  can  grow  with  dioxane,  the  bacterium  must  derive  energy  from  the 
compound’s  transfonnation.  The  glyoxylate  carboligase  pathway  represents  a  potential  route  to 
energy  generation  during  dioxane  metabolism  via  glyoxylate,  so  we  sought  to  confirm  the 
activity  of  this  pathway  in  dioxane-grown  cells.  Glyoxylate  carboligase  (Gel)  activity  (2 
glyoxylate  — +  CO2  +  tartronate  semialdehyde,  Fig.  5.2)  was  tested  in  cell- free  extracts  prepared 
from  strain  CB1190  cells.  As  Fig.  5. 3 A  shows,  pre- growth  on  dioxane  resulted  in  significantly 
greater  consumption  of  glyoxylate  than  pre-growth  on  pyruvate.  Glyoxylate  disappearance  was 
accompanied  by  a  stoichiometric  production  of  CO2,  a  specific  product  of  the  glyoxylate 
carboligase  reaction.  Heating  of  the  dioxane  cell- free  extracts  for  10  min  at  80°C  reduced  CO2 
production  to  that  observed  with  the  buffer  treatment  (data  not  shown).  No  CO2  was  detected  in 
dioxane  cell-free  extracts  when  glyoxylate  was  omitted. 

To  verily  the  putative  roles  of  strain  CB1190  genes  in  the  dioxane- induced  glyoxylate 
carboligase  pathway,  we  cloned  and  over- expressed  Psed_3890  (putative  glyoxylate  carboligase 
gene)  and  Psed_3889  (putative  tartronate  semialdehyde  reductase  gene,  Fig.  5.2)  in  Rhodococcus 
jostii  strain  RHA1  using  the  thiostrepton- inducible  plasmid  pTip-QC2  (102).  As  expected, 
glyoxylate  was  consumed  and  CO2  produced  only  in  cell- free  extracts  from  Psed_3 890- 
expressing  strain  RHA1,  or  when  these  extracts  were  mixed  with  extracts  from  Psed_3889- 
expressing  RHA1  (Fig.  5.3B).  The  mixture  of  cell-free  extracts  from  RHAl/pTip-Psed_3890  and 
RHAl/pTip-Psed_3889  also  produced  a  small  amount  of  glycerate  (0.71±0.23  pmol  per  vial). 

No  glycerate  was  produced  with  cell- free  extracts  from  either  construct  alone,  or  with  cell- free 
extracts  from  strain  RHA1  with  empty  pTip-QC2.  The  amount  of  glycerate  detected  with  the  mix 
of  RHAl/pTip-Psed_3890  and  RHAl/pTip-Psed_3889  cell-free  extracts  represented  ~7%  of  that 
expected  if  tartronate  semialdehyde  was  stoichiometrically  transformed  to  glycerate  (Fig.  5.2)  as 
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a  dead-end  product.  Tartronate  semialdehyde  was  not  detectable  with  the  HPLC  method  used,  so 
it  is  unclear  whether  the  small  amount  of  glycerate  detected  was  due  to  only  a  minor 
transformation  of  tartronate  semialdehyde  or  due  to  the  further  transformation  of  the  generated 
glycerate  to  unspecified  products.  Regardless,  these  results  support  the  annotation  ofPsed_3890 
as  a  glyoxylate  carboligase  gene  and  Psed_3889  as  a  tartronate  semialdehyde  reductase  gene. 


i  i  Glyoxylate  remaining  H  C02  produced 


Figure  5.3.  Glyoxylate  carboligase  activityinstramCB1190andstramRHAl/pTip  cell-free 
extracts.  A)  Phosphate  buffer  or  cell-free  extracts  from  pyruvate-  or  dioxane -grown  strain  CB1 190  cells 
were  amended  with  10  mM  glyoxylate  and  then  glyoxylate  remaining  and  C02  production  were 
determined.  B)  The  same  assay  was  performed  with  cell-free  extracts  prepared  from  strain  RHA1  cells 
with  pTip-Psed_3890  (3890;  putative  glyoxylate  carboligase),  pTip-3889  (3889;  putative  tartronate 
semialdehyde  reductase),  a  mix  of  the  two  (3890  +  3889),  pTip-QC2  (empty  vector),  or  buffer. 

Amino  acid  isotopomer  analysis  to  identify  routes  for  dioxane  carbon  assimilation 

The  glyoxylate  carboligase  pathway  ultimately  leads  to  the  assimilation  of  carbon  as 
three- carbon  metabolites  in  glycolysis  as  3-phosphoglycerate  and  later  pyruvate  (49).  In  order  to 
obtain  further  support  for  the  hypothesis  that  carbon  from  dioxane  enters  central  metabolism  via 
the  glyoxylate  carboligase  pathway  as  three- carbon  compounds,  we  perfonned  isotopomer  amino 
acid  analysis  with  1 3C-fully- labeled  dioxane.  Strain  CB1190  was  grown  either  with  13C-dioxane 

1  T  1  •} 

as  the  sole  carbon  source  or  with  C-dioxane  and  elevated  (3%)  CO2.  With  C-dioxane  as  the 
sole  carbon  source,  all  of  the  detected  amino  acids  were  heavily  labeled,  which  conlinned,  as 
expected,  that  strain  CB1 190  can  directly  use  dioxane  as  a  sole  carbon  source. 

Pyruvate- derived  alanine,  valine,  phenylalanine  and  tyrosine  were  predominantly  Re¬ 
labeled  when  strain  CB1190  was  grown  with  13C-dioxane  as  the  sole  carbon  source,  and  the 
addition  of  elevated  unlabeled  CO2  did  not  afreet  these  labeling  patterns.  These  results  are  in 
contrast  to  aspartate,  threonine,  serine  and  glycine;  for  these,  a  small  proportion  of  unlabeled 
carbon  was  integrated  when  the  cells  were  grown  with  13C-dioxane  as  the  sole  carbon  source, 
and  this  proportion  increased  greatly  when  strain  CB1190  was  grown  with  13C-dioxane  and 
elevated  unlabeled  CO 2.  These  results  indicate  that  carbon  from  CO2  was  assimilated  during  the 
synthesis  of  these  amino  acids  (Fig.  5.4). 
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Figure  5.4.  Proposed  assimilation  ofunlabeled  carbon  during  growth  of  strain  CB1190  with 
uniformly  13C-labeled  dioxane. Unlabeled  carbons  are  indicated  with  an  asterisk!*);  all  other  carbons 
are  considered  13C.  The  dashed  arrow  indicates  a  multi-step  reaction.  The  amino  acids  valine, 
phenylalanine  and  tyrosine  were  omitted  for  simplicity. 

For  the  oxaloacetate- derived  aspartate  and  threonine,  the  unlabeled  carbon  was  present  at 
a  non-first  carbon  position  under  both  experimental  conditions.  These  results  indicate  that  the 
anaplerotic  pathway  transforming  either  phosphoenolpyruvate  (PEP)  or  pyruvate  to  oxaloacetate 
through  the  addition  ofCC>2(69,  171)  was  active  during  dioxane  metabolism  (Fig.  5.4).  Strain 
CB1190has  homologous  genes  for  both  PEP  carboxylase  (Psed_6164)  and  pyruvate  carboxylase 
(Psed_3032),  which  catalyze  these  anaplerotic  reactions.  The  enzyme  that  catalyzes  the  reverse 
of  the  PEP-utilizing  process,  PEP  carboxykinase,  is  putatively  encoded  by  Psed_6619,  and  this 
gene  was  down-regulated  during  growth  with  both  dioxane  and  glycolate,  relative  to  pyruvate. 
Taken  together,  these  results  indicate  that  with  dioxane,  carbon  flows  predominantly  from  the 
direction  of  PEP  towards  the  TCA  cycle,  and  support  the  proposed  role  of  the  three-carbon 
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compound-generating  glyoxylate  carboligase  pathway  in  dioxane  metabolism.  In  contrast,  when 
grown  with  pyruvate,  strain  CB1190  needs  to  generate  PEP  and  turther  gluconeogenic 
compounds  necessary  for  amino  acid  synthesis,  which  explains  the  relative  upregulation  of  the 
gene  (Psed_6619)  encoding  the  PEP -generating  PEP  carboxykinase. 

The  inconsistency  between  the  labeling  patterns  of  3-phosphoglycerate-derived  serine 
and  glycine  and  pyruvate- derived  alanine,  valine,  phenylalanine  and  tyrosine  is  noteworthy, 
since  3-phosphoglycerate  is  a  precursor  of  pyruvate  in  glycolysis.  An  explanation  for  this 
inconsistency  is  an  alternative  pathway  of  serine  and  glycine  biosynthesis,  specifically,  by  the 
reverse  activity  of  the  glycine  cleavage  system  (GCS).  The  GCS  converts  glycine  and 
tetrahydro folate  to  CC>2,NH3  and  methylene-tetrahydro folate  (129),  and  it  is  putatively  encoded 
by  the  strain  CB1190  genome  (Chapter  7)  The  GCS  is  reversible  (67),  and  reverse  GCS  activity 
in  the  presence  of  elevated  unlabeled  CCB  would  result  in  increased  unlabeled  carbon  in  the 
glycine  pool,  and  consequently,  in  the  serine  pool  as  well  (Fig.  5.4).  Thus,  during  dioxane 
metabolism,  it  is  possible  that  serine  and  glycine  are  synthesized  from  both  the  glyoxylate 
carboligase  pathway  via  3-phosphoglycerate  and  the  reverse  GCS. 

Glyoxylate  could  contribute  to  the  serine  pool  in  yet  another  way.  The  strain  CB1190 
glyoxylate  carboligase  cluster  has  a  gene  (hyi)  for  hydroxypyruvate  isomerase,  which  catalyzes 
the  isomerization  of  hydroxypyruvate  to  tartronate  semialdehyde  (6)  and  which  is  up-regulated  in 
strain  CB1190  during  growth  with  dioxane  (Tables  5.2  and  5.3).  Hydroxypyruvate  can  be  used 
as  a  precursor  tor  serine  biosynthesis  by  the  activity  of  transaminases  (133).  However,  no  genes 
with  significant  homology  to  known  amino  acid- glyoxylate  aminotransferase  or  amino  acid- 
hydroxypyruvate  transaminase  genes  were  up-regulated  in  strain  CB1190  during  growth  with 
dioxane  or  glycolate,  so  the  role  of  hydroxypyruvate  isomerase  in  dioxane  metabolism  remains 
unclear. 

Energy  generation  in  strain  CB1190  dioxane  metabolism 

Based  on  the  current  work,  we  present  an  updated  proposed  dioxane  degradation  pathway 
for  strain  CB1190,  linking  dioxane  metabolites  to  central  metabolism  (Fig.  5.1).  The  ability  of 
strain  CB1190  to  grow  on  dioxane  necessitates  energy  generation,  and  the  employment  of  the 
glyoxylate  carboligase  pathway  during  dioxane  degradation  conceptually  provides  the  basis  for 
energy  production,  since  this  pathway  results  in  the  pyruvate  precursors  that  can  then  be 
metabolized  to  yield  NADH  and  ATP  (73).  An  analysis  of  the  reducing  equivalents  consumed  or 
produced  in  the  individual  steps  proposed  for  dioxane  metabolism  in  Fig.  5.1  reveals  a  maximum 
net  yield  of  6  NADH  and  one  GTP  per  dioxane.  Assuming  an  equivalence  of  3  NADH  per  ATP, 
this  means  the  theoretical  maximum  yield  of  ATP  generated  per  dioxane  is  19.  Mahendra  and 
Alvarez-Cohen  (89)  previously  reported  a  yield  of  0.09  g  protein  per  g  dioxane  for  strain 
CB1190,  so  assuming  that  protein  constitutes  50%  of  dry  cell  weight  and  integrating  this  cell 
yield  with  the  theoretical  maximum  yield  of  ATP  from  dioxane,  this  results  in  a  Yatp  (9)  of  0.83 
g  dry  cell  weight  per  mol  ATP.  In  comparison,  Lactobacillus  casei  growing  with  glucose  (128) 
and  Actinomyces  strains  growing  with  lactate  (162)  had  Yatp  values  of  24  and  10  g  dry  cell 
weight  per  mol  ATP,  respectively. 

This  work  represents  the  first  insights  into  the  genetic  basis  for  microbial  dioxane 
metabolism  and  provides  a  more  complete  picture  of  the  pathway  leading  to  carbon  assimilation 
and  energy  generation  during  growth  on  dioxane.  It  will  be  usefirl  to  detennine  if  dioxane 
supports  growth  through  a  similar  pathway  in  the  other  reported  dioxane-metabolizing 
microorganisms,  including  Pseudonocardia  benzenivorans  strain  B5  (89),  Mycobacterium  sp. 
PH-06  (66),  C.  sinensis  (101)  and  Rhodococcus  ruber  strain  219  (13). 
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6.  Transcriptomics  of  THF  degradation  and  investigation  of  monooxygenase 
activities  in  Pseud on ocardi a  dioxanivorans  strain  CB1190 


This  is  a  summary  of  work  published  in: 

1.  Sales,  C.  M.  2012.  Functional  genomics  of  bacterial  degradation  of  the  emerging 
contaminants  1,4-dioxane  and  N-nitrosodimethylamine  (NDMA).  Ph,D.  thesis.  University  of 
California,  Berkeley. 

2.  Additional  results  presented  here  are  in  publications  in  preparation. 

Introduction 

A  number  of  studies  have  reported  the  ability  of  pure  and  mixed  cultures  of  bacteria  and 
fijngi  to  degrade  dioxane  aerobically  (13,  19,  66,  89,  101,  112,  125,  127,  141,  161,  180)  and  one 
study  reported  anaerobic  degradation  (139).  The  most  common  mechanism  for  aerobic  dioxane 
biodegradation  is  co-metabolism,  by  which  another  substrate  is  used  to  provide  carbon  and 
energy  to  the  microorganism.  However,  a  handful  of  bacteria,  including  Rhodococcus  ruber  219 
(13),  Mycobacterium  sp.  PH-06  (66),  Pseudonocardia  dioxanivorans  CB1 190  (112), 
Pseudonocardia  benzenivorans  B5  (89)and  the  fungus  Cordyceps  sinensis  (101),  are  capable  of 
growth  on  dioxane  as  their  sole  carbon  and  energy  source.  In  contrast,  metabolism  of  the  cyclic 
ether  tetrahydrofuran  (THF)  as  a  sole  carbon  and  energy  source  among  bacteria  is  more 
common.  Many  of  actinobacteria  can  utilize  THF  for  growth  (13,  68,  112,  154,  161)as  can  some 
fungal  isolates  (102,  115,  141). 

The  initial  step  in  the  aerobic  biodegradation  of  both  dioxane  and  THF  is  catalyzed  by  a 
monooxygenase  reaction.  Metabolic  pathways  for  THF  degradation  have  been  proposed 
for  R.  ruber  219  (13),  Pseudonocardia  tetrahydrofuranoxydans  K1  (154), 
and  Graphium  sp.  (141).  Common  among  these  metabolic  pathways  is  the  initial  oxidation  of 
THF  to  2-hydroxytetrahydrofuran  and  the  formation  of  succinate  as  the  downstream 
intermediate.  No  intermediates  of  THF  degradation  by  R.  ruber  2 19  have  been  detected,  but  the 
ability  of/?,  ruber  219  to  grow  on  the  hypothetical  intermediates  y-butyrolactone  and  succinate 
has  been  shown  (13).  In  P.  tetrahydrofuranoxydans,  a  gene  cluster  involved  in  the  utilization  of 
THF  was  cloned  and  sequenced  (154).  This  gene  cluster  contained  a  four- subunit 
multicomponent  monooxygenase,  thmADBC,  putatively  involved  in  the  oxidation  of  THF  to  2- 
hydroxytetrahydrofuran;  a  succinate  semialdehyde  dehydrogenase,  sad,  responsible  for  the 
conversion  of  succinic  semialdehyde  to  succinate;  and  an  aldehyde  dehydrogenase,  aldH.  In  the 
lungus  Graphium  sp.,  a  cytochrome  P450  monooxygenase  enzyme  was  suggested  to  catalyze  the 
initial  oxidation  of  THF.  The  only  intermediate  of  THF  degradation  detected  in  Graphium  sp.,  or 
any  other  strain,  has  been  y-butyrolactone  (141).  In  both/5. 

tetrahydrofuranoxydans  and  Graphium  sp.,  the  conversion  of  2-hydroxytetrahydrofuran  to  y- 
butyrolactone  is  proposed  to  be  catalyzed  by  an  alcohol  dehydrogenase.  Furthermore,  the 
conversion  of  y-butyrolactone  to  4-hydroxybutyrate  is  proposed  to  be  an  abiotic  process  and  the 
subsequent  reaction  of  4-hydroxybutyrate  to  succinic  semialdehyde  is  catalyzed  by  an  alcohol 
dehydrogenase.  Homologues  of  the  thin  cluster  from  P.  tetrahydrofuranoxydans  have  been  found 
in  Pseudonocardia  sp.  ENV478  (161),  Rhodococcus  sp.  YYL  (173), 
and  P.  dioxanivorans  CBM  90  (132). 
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In  Chapters  4  and  5  we  used  genomics,  transcrip  to  mics  and  biochemical  assays  to  gain 
insights  into  dioxane  metabolism  by  strain  CB1 190.  These  studies  identified  a  gene  cluster 
encoding  a  putative  multi-component  monooxygenase  homologous  to  the  THF-degradation- 
associated  thmADBC  gene  cluster  from  P.  tetrahdyrofuranoxydans  K1  (154)  and  strain  ENV478 
(93).  Both  dioxane  and  THF  degradation  pathways  are  initiated  by  a  monooxygenase,  as 
discussed  above,  but  in  strain  CB1190  the  genetic  basis  lor  THF  degradation  has  not  yet  been 
examined.  The  purpose  of  the  work  in  this  chapter  is  therefore  two-fold:  1)  determine  the  genes 
involved  in  THF  metabolism  in  strain  CB1190,  and  2)  investigate  the  ftmctional  role  of  the 
monooxygenase  gene  cluster  thmADBC  in  THF  and  dioxane  degradation  and  in  the 
transformation  of  the  dioxane  metabolism  intennediate  HEAA. 

Methods 

Chemicals 

The  potassium  salt  of  (3-hydroxyethoxyacetic  acid  (HEAA)  was  prepared  by  CanSyn 
Chemical  Corporation  (Toronto,  ON). 

Culture  conditions 

Cells  of  strain  CB1 190  were  grown  in  ammonium  mineral  salts  (AMS)  liquid  medium 
(112).  The  growth  substrate  dioxane,  THF,  isopropanol,  succinate,  or  glucose  were  added  to  the 
culture  medium  to  achieve  a  final  concentration  of  5  inM.  HEAA  was  added  to  cultures  as  a 
growth  substrate  with  a  final  concentration  of  1.5  inM.  Cultures  were  incubated  aerobically 
while  shaking  at  30°C. 

Cell  harvesting  and  RNA  isolation  for  transcriptional  studies 

Strain  CB1190  cells  were  harvested  and  RNA  purified  for  transcrip  to  mic  microarray 
analysis  and  quantitative  reverse  transcriptase  PCR  (qRT-PCR)  experiments  using  the  methods 
described  in  Chapter  5. 

Analytical  methods 

Dioxane  and  THF  removal  was  monitored  by  injection  of  liquid  samples  onto  a  Varian 
3400  gas  chromatograph  with  a  flame  ionization  detector  and  an  1%  AT- 1000  on  a  carbographl 
packed  column  (Grace,  Columbia,  MD).  Samples  were  analyzed  isothennally  at  170°C,  and  the 
injector  and  detector  temperatures  were  230°C  and  250°C,  respectively. 

The  presence  of  HEAA  in  samples  was  measured  by  liquid  chromatography-tandem  mass 
spectrometry  (LC-MS/MS)  using  a  hydrophilic  interaction  chromatography  (HILIC)  column. 
Samples  were  filtered  with  0.2  pm  syringe  filters  to  remove  cells.  Standards  and  samples  were 
measured  onanAgilient  Technologies  1200  Series  LC  system  equipped  with  an  Agilient 
Technologies  Zorbax  HILIC  Plus  column  (4.6  mm  x  100  mm,  3.5  pm)  coupled  to  an  Agilent 
Technologies  6410  tandem  triple  quadrupole  (QQQ)  mass  spectrometer  (Santa  Clara,  CA).  The 
LC  solvents  used  were  Solvent  A:  aqueous  buffer  and  Solvent  B:  acetonitrile.  The  aqueous 
buffer  was  0.45  pm  filtered,  10  inM  ammonium  fonnate  in  water  with  formic  acid  to  adjust 
acidity  (pH  5).  The  flow  rate  was  0.5  mL/min.  The  gradient  was:  t  =  0  min.,  95%  B;  t  =  10  min., 
40%  B;  t  =  13  min.,  95%  B;  t  =  15  min.,  95%  B.  The  injection  volume  for  samples  and  standards 
was  10  pL.  Column  effluent  was  introduced  into  the  electrospray  chamber,  where  the 
electrospray  ionization  was  set  to  3000  V  in  negative  mode  (ESI-).  Nitrogen  was  used  as  the 
nebulizing  gas  at30psi,  325°C,  and  a  gas  flow  of  11  L/min.  Multiple  reaction  monitoring 
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(MRM)  was  used  to  monitor  the  transitions  from  the  parent  ion  ofHEAA,  m/z  119  [€4^04]' to 
the  major  product  ions  of  m/z  1 19  [C4H7O4]',  m/z  101  [C4H5O2]",  m/z  75  [C2H3O3]",  and  m/z  31 
[CH3O]"  during  a  run.  The  collision  gas  used  for  MRM  was  argon,  with  the  collision  energy  of  6 
V  and  a  fragmentor  of  70  V  for  all  MRM  transitions.  The  summation  of  the  peak  areas  for  all 
MRM  transitions  was  used  for  quantification  of  standards  and  samples. 

Trans criptomics  microarray  analysis 

Five  pg  of  total  RNA  was  used  as  starting  material  for  each  microarray  chip,  which  were 
processed  using  the  protocol  described  in  Chapter  5.  All  microarray  data  analyses  were 
performed  in  the  R  statistical  programming  environment  (www.r-project.org)  using  packages 
available  from  Bioconductor  version  2.9  (www.bioconductor.org)  (Gentleman  etai,  2004). 
Hybridization  signal  intensities  for  probe  sets  were  calculated  using  the  “rma”  ftmetion  from  the 
“ally”  package  (41). The  “rma”  ftmetion  implements  the  computation  of  the  RMA  expression 
measure  (57),  which  summarizes  the  probe  intensities  for  each  probe  set,  by  background 
adjustment,  quantile  normalization  (15),  and  the  median  polish  linear  fitting  procedure. 

Identification  of  differentially  expressed  genes  was  accomplished  using  the  “limma” 
package  (143),  which  implements  a  linear  models  approach  to  analyzing  designed  microarray 
experiments.  This  approach  first  required  fitting  the  microarray  data  to  a  linear  model,  specified 
by  a  design  matrix  that  indicates  which  RNA  samples  have  been  applied  to  each  array,  using  the 
function  “lmFit”.  Since  the  expression  data  was  from  Affymetrix  GeneChips,  the  linear  modeling 
was  the  same  as  one-way  analysis  of  variance  (ANOVA).  The  second  step  in  this  approach 
involved  specifying  the  comparisons  to  be  made  between  groups  of  arrays  in  a  contrast  matrix, 
such  as  contrasting  all  treatments  (dioxane,  gfycolate,  THF,  and  succinate)  to  the  control 
(pyruvate)  and  direct  comparison  between  treatments  ( e.g .,  dioxane  to  gfycolate  or  THF  to 
succinate).  This  contrast  matrix  was  applied  to  the  linear  fitted  microarray  data  using  the 
function  “contrast. fit”.  Estimated  log-fold  changes  (log^FC)  and  statistics  used  to  assess 
differential  expression  between  contrasts  were  determined  using  an  empirical  Bayes  method, 
implemented  in  the  function  “eBayes”.  The  P  values  returned  from  linear  modeling  were 
adjusted  to  correct  for  multiple  hypothesis  testing  by  applying  the  Benjamini  and  Hochberg 
procedure  (11).  This  procedure  allowed  for  the  control  of  the  false  discovery  rate  (FDR)  to  1% 
(i.e.,  by  only  considering  contrasts  with  an  adjusted  p- value  less  than  0.01).  Genes  that  met  the 
criteria  of  FDR  <1%  and  a  log2FC  >  |1|  were  considered  differentially  expressed. 

Quantitative  RT-PCR  analyses 

Total  RNA  was  isolated  from  strain  CB1 190  cells  after  8  hours  of  exposure  to  either  5 
mM  of  dioxane,  5  mM  of  THF,  20%  (voL/voL  of  total  headspace)  of  propane,  or  5  mM  of 
glucose  (control).  Cells  for  induction  analysis  were  grown  in  AMS  medium  with  5  mM  of 
glucose  and  harvested  by  filtration.  The  harvested  cells  were  washed  with  and  resuspended  in 
AMS  medium  The  resuspended  cells  were  aliquoted  so  each  treatment  and  control  condition 
was  tested  in  triplicate.  The  isolated  RNA  was  used  to  synthesize  cDNA  using  the  TaqMan 
reverse  transcription  kit  (Applied  Biosystems).  TaqMan  chemistry  was  used  for  qPCR  reactions 
targeting  thmA  (Psed_6976),  prmA  (Psed_0639),  tpi  (Psed_3417),  thiC  (Psed_6168),  and  rpoD 
(Psed_0376).  The  genes  tpi,  thiC,  and  rpoD  were  determined  by  the  geNorm  method  (163)  to  be 
stably  transcribed  across  all  treatments,  and  were  thus  used  as  internal  references.  Each  qPCR 
reaction,  which  was  run  in  triplicate  for  each  biological  replicate  on  an  Applied  Biosystems 
StepOne  Plus  real-time  PCR  system  consisted  of  IX  Fast  Universal  Mix  (Applied  Biosystems), 

2  pL  of  diluted  cDNA,  each  primer  at  0.5  mM,  and  the  probe  at  145  nM.  The  cycling  conditions 
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were:  95°C  for  20  s,  then  40  cycles  of  95°C  for  1  s  followed  by  annealing  for  20  s.  The 
efficiency  of  each  qPCR  assay  was  detennined  with  a  serial  dilution  of  cDNA  derived  from  a 
dioxane  treatment  replicate.  Gene  expression  was  nonnalized  using  the  method  of  Vandesompele 
et  al.  (163). 

thm  gene  duster  transcriptional  analysis 

Cells  for  RT-PCR  analysis  of  the  THF  monooxygenase  genes  thmADBC  were  grown  on 
dioxane,  isopropanol,  or  THF  and  harvested  by  centrifugation.  RNA  was  isolated  using  the 
RNeasy  kit.  RT-PCR  was  performed  using  the  Qiagen  One  Step  RT-PCR  kit.  The  primers  used 
for  amplification  of  each  fragment  were:  fragment  A,  thmA  For  l  5’- 
ATGACTGCCCCACCGATGAAG-3’  and  thmA  Rev  5’- 
AACGGTAGTAGTC GTC ATTC ACC-3 fragment  B,  thmA_For2  5’- 
C  C  TATAAGC  GAGT  GG  A  AT  AC  -  3  ’  and  thmDRev  5’- 
CGATGAGTAACGCCGAAC GAC TC-3 fragment  C,  thmD  For  5’- 
GCTGCGCCAGATGTCAGAGG-3  ’  and  thmC  Revl  5’- 
CCACATAATCATAAGCGAC GTC G-3 and  fragment  D,  thmBFor  5’- 
CTC ATGAGCGCGAGATCGAG- 3 ’ , thmC  Re v2  5 ’-CCACATAATC ATAAGCGAC GTC G  -3 ’. 
RT-PCR  products  are  visualized  on  an  electrophoresis  geL 

Cloning  and  expression  of  strain  CB1190  THF  monooxygenase  genes  in  R.  jostii  strain 
RHA1 


The  4.3  Kb  fragment  of  strain  CB1190  genes  Psed_6976-6979  was  initially  PCR- 
amplified  and  cloned  into  the  plasmid  pK18  (121).  The  Psed_6976-6979  fragment  encoding  for 
the  THF  monooxygene  genes  thmADBC  was  then  subcloned  into  plasmid  pTip-QC2  (102).  The 
fragments  was  amplified  with  forward  primer  5’- 

AAGGAGATATACATATGACTGCCCCACCGAT-GAA-3’  and  reverse  primer  5’- 
GTATGCGGCCGCCATGGAATTCTACGACTCAGA-GTTGATCAGCTCGAT-3’.  where  the 
gene-binding  sequence  is  underlined  and  the  Ndel  and  Xhol  restriction  sites,  respectively,  are  in 
bold.  Each  100  pL  PCR  reaction  consisted  of  IX  HF  Buffer,  200  nM  dNTPs,  500  nM  each 
primer,  3%  DMSO,  2  units  Phusion  Hot  Start  II  DNA  polymerase  (Finnzymes)  and  lOngof 
plasmid  pT7-7/thfino  DNA.  Thennocyc ling  conditions  were  as  follows:  98°C  for  3  min,  then  30 
cycles  of  (98°C  for  20  s,  72°C  for  6  min),  then  72°C  for  10  min.  The  appropriate  amplicon  was 
then  gel-purified. 

Plasmid  pTip-QC2  was  linearized  with  Ndel  and  EcoRI  (New  England  Biolabs),  gel 
purified,  and  then  the  plasmid  and  PCR  insert  were  ligated  at  a  1:3  (plasmid: insert)  ratio  at  16°C 
overnight  with  T4  DNA  ligase  (New  England  Biolabs).  Electrocompetent  E.  coli  DH5a  was 
transfonned  via  electroporation  with  1  pL  of  ligation  mix.  Ampicillin-resistant  colonies  were 
screened  for  the  appropriate  construct,  which  was  named  pTip-thfino. 

Fifty  ng  of  purified  plasmid  pTip-thfino  or  empty  vector  pTip-QC2  were  used  to 
transfonn  electrocompetent  Rhodococcus  jostii  strain  RHA1  (strain  RHA1)  according  to  the 
method  ofKalscheuer  et  al.  (61),  except  that  cells  were  not  pre- incubated  prior  to 
electroporation,  and  the  electroporation  parameters  were:  12  kV/cm,  800  Q;  and  25  pF. 

Single  colonies  of  strain  RHA1  containing  the  plasmid  pTip  constructs  were  used  to 
inoculate  nutrient  broth  amended  with  chloramphenicol  and  grown  at  30°C  with  shaking  for  48 
h.  Two  liter  flasks  containing  0.5  L  of  nutrient  broth  amended  with  chloramphenicol  and  1% 
each  of  sucrose  and  glycine  were  inoculated  with  10  mL  of  48  h  culture  and  incubated  at  30°C 
with  shaking.  When  the  cultures  had  reached  OD600  of  0.8  (-21  h),  thiostrepton  dissolved  in 
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DMSO  was  added  to  a  final  concentration  of  1  pg/mL  and  the  cultures  were  incubated  for  a 
fiirther  26  h.  Cells  were  transferred  to  centrifuge  bottles  (Beckman  Coulter,  Brea,  CA),  cooled  on 
ice,  pelleted  by  centrifugation,  and  then  cell  pellets  were  washed  twice  with  ice-cold  buffer  (20 
inM  sodium  phosphate,  pH  7.0,  0.05%  Tween-80).  The  cells  were  finally  resuspended  in  ice-cold 
buffer,  aliquoted  to  1.5  mL  microcentrifuge  tubes,  and  then  stored  at  at-80°C  until  use  in 
enzyme  assays. 

Aliquots  of  strain  RHA1  carrying  plasmid  pTip  constructs  were  thawed  and  resuspended 
in  1  mL  of  cold  buffer.  The  transfonnation  of  dioxane  and  THF  and  production  of  HEAA  by  cell 
suspensions  of  RHAl/pTip-thfmo  or  RHAl/pTip-QC2  was  tested  in  2  mL  microcentrifuge  tubes 
containing  1  mL  of  buffer  and  cell  suspension.  Each  triplicate  tube  received  100  pL  of  cell 
suspension,  and  was  amended  with  dioxane  (2.5  mM),  THF  (4.0  mM)  or  HEAA  (10  mM)  from 
aqueous  stocks.  Buffer  controls  (no  cell  suspension)  vials  were  prepared  in  parallel  Vials  were 
incubated  at  30°C  with  horizontal  shaking  at.  At  several  timepoints,  200  pL  were  removed  and 
analyzed  for  the  disappearance  of  the  amended  compound.  GC-FID  (for  dioxane  and  THF)  and 
LC-MS/MS  (HEAA)  was  used  for  analyzing  samples. 

HEAA  transformation  inhibition  assay 

For  assays  to  test  inhibition  of  HEAA  degradation  by  acetylene,  strain  CB1190  was 
grown  with  dioxane  and  harvested  by  filtration.  Cells  were  washed  on  the  filter  with  phosphate 
buffer  to  remove  residual  dioxane,  and  then  scraped  cells  were  resuspended  in  phosphate  buffer. 
One  half  of  the  cell  suspension  was  exposed  briefly  to  5%  (vol/vol)  acetylene  in  the  headspace 
for  10  minutes  while  shaking  at30°C.  N2  gas  was  then  used  to  remove  residual  acetylene.  The 
non-acetylene-exposed  half  of  the  cell  suspension  was  treated  identically,  without  the  addition  of 
acetylene.  Finally,  HEAA  was  amended  to  the  cell  suspensions  and  abiotic  controls  and  the 
concentration  of  HEAA  in  each  suspension  was  monitored  over  time. 

Results  and  Discussion 

Transcriptomics  ofP.  dioxani varans  s  train  CB1190  growth  on  THF  and  succinate 

Succinate  is  an  intermediate  of  aerobic  THF  degradation  in  the  bacterium  P. 
tetrahydrofuranoxydans  (154)  and  the  fungus  Graphium  sp.  (141).  Therefore,  in  order  to 
determine  the  genes  involved  in  the  degradation  of  THF  in  strain  CB1190,  the  transcrip  tomes 
during  THF  and  succinate  growth  were  also  compared  directly  and  relative  to  pyruvate.  The 
transcrip  tomes  of  THF- and  succinate- grown  strain  CB1 190  were  determined  by  microarray 
analysis.  When  the  THF-  and  succinate-grown  transcrip  tomes  were  individually  contrasted 
against  the  pyruvate- grown  transcrip  to  me,  717  and  580  genes  were  differentially  expressed, 
respectively.  Growth  on  THF  versus  pyruvate  led  to  the  up-regulation  of  224  genes  and  the 
down-regulation  of  493  genes.  Growth  on  succinate  versus  pyruvate  led  to  the  up-regulation  of 
377  genes  and  the  down-regulation  of  203  genes. 

A  total  of  830  genes  were  differentially  expressed  on  THF  versus  succinate,  of  which, 

220  were  up-regulated  and  610  were  down- regulated.  A  total  of  92  genes  are  up-regulated  on 
THF  and  not  on  succinate,  both  relative  to  pyruvate  (Table  6.1). 
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Table  6.1  Genes  up-regulated  on  THF  but  not  succinate,  relative  to  pyruvate. 

Up-regulated  genes  have  a  log2FC  >  1,  FDR  (adjusted  p-value)  <  0.01  (not  shown).  [Genes  highlighted  in 
Red  are  not  differentially  expressed  log2FC  and  FDR  <0.01  when  comparing  the  transcriptomes  ofTHF- 


and  succinate-grown  strain  CB1 190  directly.] 

THF  vs. 

Succinate 

Gene 

Protein 

Pyruvate 

log2FC 

vs. 

Pyruvate 

log2FC 

Psed_0094 

YCII-related 

2.46 

0.77 

Psed_0095 

RNA  polymerase  sigma  factor,  sigma-70  family 

1.52 

0.21 

Psed_0129 

protein  of  unknown  function  DUF59 

1.62 

0.67 

Psed_0130 

amidohydrolase  2 

2.17 

0.83 

Psed_0131 

Alcohol  dehydrogenase  GroES  domain  protein 

1.99 

0.60 

Psed_0288 

S -( h  y d ro xy  me t h  y  1  )g  1  u t a t h  io  n  e  dehydrogenas  e 

1.76 

0.18 

Psed_0355 

hypothetical  protein 

1.26 

-0.41 

Psed_0357 

helix-tum-helix  domain  protein 

1.08 

0.06 

Psed_0382 

hypothetical  protein 

1.16 

0.84 

Psed_0554 

hypothetical  protein 

2.06 

0.59 

Psed_0934 

YCII-related 

2.58 

-0.16 

Psed_1301 

Redoxin  domain  protein 

1.03 

-0.05 

Psed_1303 

Heavy  metal  transport/detoxification  protein 

1.05 

-0.72 

Psed_1584 

6-phosphofructokinase 

1.05 

-0.49 

Psed_1967 

major  facilitator  superfamily  MFS  1 

1.85 

0.09 

Psed_1968 

hypotheticalprotein 

2.11 

0.27 

Psed_2321 

ABC-type  sugartransport  systemperiplasmic  component-like 

1.22 

-0.29 

Psed_3256 

hypotheticalprotein 

3.34 

0.99 

Psed_3442 

protein  of  unknown  function  DUF939 

1.05 

0.74 

Psed_3564 

Aromatic-amino-acid  transaminase 

1.66 

0.47 

Psed_3652 

hypotheticalprotein 

1.52 

-0.25 

Psed_3669 

hypotheticalprotein 

2.28 

0.66 

Psed_3986 

Peptidase  M23 

1.05 

0.54 

Psed_4083 

hypotheticalprotein 

1.23 

0.80 

Psed_4236 

hypotheticalprotein 

1.03 

-0.01 

Psed_4268 

membrane  protein  ofuknown  function UCP014873 

2.06 

0.12 

Psed_4321 

hypotheticalprotein 

1.55 

-0.63 

Psed_4322 

Dihydro lipoyl  dehydrogenase 

2.21 

-0.42 

Psed_4699 

protein  of  unknown  function  DUF1365 

1.62 

0.83 

Psed_4702 

protein  of  unknown  function  DUF1295 

3.20 

0.52 

Psed_4753 

nitrogen  regulatory  protein  P-II 

1.15 

0.06 

Psed_4754 

nitrogen  regulatory  protein  P-II 

1.34 

0.41 

Psed_4937 

hypotheticalprotein 

1.06 

-0.61 

Psed_4938 

regulatory  protein  MarR 

2.83 

-0.16 

Psed_4939 

Lipase 

6.70 

-0.18 
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Psed_4940 

fumarylacetoacetate  (FAA)  hydrolase 

5.21 

-0.01 

Psed  4941 

Formyl-CoA  transferase 

2.78 

0.02 

Psed_5082 

YVTN  beta-propeller  repeat-containing  protein 

1.20 

0.23 

Psed_5176 

hypothetical  protein 

1.61 

0.66 

Psed_5224 

hypothetical  protein 

1.19 

-0.55 

Psed_5245 

regulatory  protein  TetR 

1.13 

-0.15 

Psed_5246 

uncharacterized  peroxidase-related  enzyme 

1.64 

0.01 

Psed_5247 

hypothetical  protein 

2.10 

-0.13 

Psed_5248 

Nitroreductase 

4.37 

-0.04 

Psed_5249 

NADPFhquinone  reductase 

3.45 

-0.25 

Psed_5250 

major  facilitator  superfamily  MFS  1 

1.04 

0.10 

Psed_5736 

Transglycosylase-like  domain  protein 

2.30 

0.94 

Psed_5747 

hypothetical  protein 

1.83 

0.53 

Psed_5748 

hypothetical  protein 

1.10 

0.58 

Psed_5749 

hypothetical  protein 

1.31 

0.70 

Psed_5752 

hypothetical  protein 

1.04 

0.32 

Psed_5753 

hypothetical  protein 

1.73 

0.29 

Psed_5755 

hypotheticalprotein 

1.21 

0.66 

Psed_5756 

hypothetical  protein 

1.16 

0.61 

Psed_5757 

hypotheticalprotein 

1.79 

0.18 

Psed_5758 

hypotheticalprotein 

2.27 

0.60 

Psed_5759 

hypotheticalprotein 

2.39 

0.45 

Psed_5760 

hypotheticalprotein 

2.22 

0.26 

Psed_5761 

hypotheticalprotein 

2.02 

0.61 

Psed_5762 

hypotheticalprotein 

1.63 

0.33 

Psed_5763 

hypotheticalprotein 

1.35 

0.35 

Psed_5764 

hypotheticalprotein 

1.31 

0.22 

Psed_5765 

hypotheticalprotein 

1.01 

0.21 

Psed_5766 

hypotheticalprotein 

1.43 

0.53 

Psed_5767 

hypotheticalprotein 

1.50 

0.14 

Psed_5768 

hypotheticalprotein 

1.43 

0.56 

Psed_5769 

hypotheticalprotein 

1.55 

0.19 

Psed_5770 

hypotheticalprotein 

1.29 

0.29 

Psed_5778 

cell  divisionFtsK/SpoIIIE 

1.33 

0.12 

Psed_5779 

hypotheticalprotein 

2.01 

0.08 

Psed_5780 

hypotheticalprotein 

1.37 

0.78 

Psed_5781 

hypotheticalprotein 

1.29 

0.81 

Psed_6160 

heavy  metal  translocating  P-type  ATPase 

1.24 

0.92 

Psed_6249 

Ribulose  bisphosphate  carboxylase  large  chain 

1.15 

-0.20 

Psed_6665 

NLP/P60  protein 

2.19 

0.27 

Psed_6780 

hypotheticalprotein 

1.57 

-0.10 

Psed_6806 

hypotheticalprotein 

1.31 

-0.12 

Psed_6807 

hypotheticalprotein 

2.79 

-0.11 
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Psed_6812 

hypothetical  protein 

1.37 

0.92 

Psed_6847 

hypotheticalprotein 

1.11 

0.08 

Psed_6857 

protein  of  unknown  function  DUF156 

1.03 

0.72 

Psed_6879 

S-adenosylmethionine  synthase 

1.82 

-0.23 

Psed_6970 

D-lactate  dehydrogenase  (cytochrome) 

2.71 

-0.20 

Psed_6971 

Hydro xyacid-oxoacid  transhydrogenase 

2.17 

0.44 

Psed_6972 

GntR  domain  protein 

1.66 

0.31 

Psed_6974 

Ethyl  tert-butyl  ether  degradation  EthD 

1.82 

0.92 

Psed_6975 

Betaine-aldehyde  dehydrogenase 

1.63 

0.73 

Psed_6977 

Ferredoxin— NAD(+)  reductase 

2.48 

0.70 

Psed_6978 

methane/phenol/toluene  hydroxylase 

1.59 

0.54 

Psed_6979 

monooxygenase  component  MmoB/DmpM 

2.23 

0.68 

Psed_6981 

Aldehyde  Dehydrogenase 

3.35 

0.88 

Psed_6982 

Mn2+/Fe2+  transporter,  NRAMP  family 

3.46 

0.25 

The  thru  gene  cluster  (Psed_6974  to  Psed_6982)  located  on  plasmid  pPSED02  is  found  in 
the  set  of  genes  up-regulated  on  THF  but  not  succinate,  relative  to  pyruvate.  The  most  highly  up- 
regulated  gene  in  the  thin  cluster  is  the  Mn2+/Fe2+ transporter.  Another  gene  (Psed_6971)  located 
just  upstream  of  the  thin  cluster  on  plasmid  pPSED02  is  also  up-regulated.  The  gene  Psed_697 1 
is  annotated  to  encode  for  a  hydroxyacid-oxoacid  transhydrogenase,  which  is  known  to  convert 
y-hydroxybutyrate  to  succinic  semialdehyde,  using  a-ketogluturate  as  a  hydrogen  acceptor 
(Kaufinann  et  al,  1988).  Other  oxoacids,  including  oxaloacetate  and  a-ketoapidate  can  also  be 
used  as  a  hydrogen  acceptor  but  with  lower  activity.  Psed_6970  is  also  up-regulated  on  THF 
versus  succinate,  relative  to  pyruvate,  and  encodes  for  a  lactate  dehydrogenase,  a  homologue  of 
y-hydroxybutyrate  dehydrogenase,  which  converts  y-hydroxybutyrate  to  succinic  semialdehyde 
using  NADH. 

Transcrip  to  mic  microarray  analysis  of  THF- and  succinate-grown  strain  CB1 190  cells 
agreed  with  the  Northern  blot  analysis  reported  for  P.  tetrahvdrofuranoxydans  that  showed  genes 
in  the  thin  cluster  to  be  expressed  during  THF  utilization  (154).  Although  a  probe  set  for  the  TFIF 
monooxygenase  oxygenase  component  a-subunit  thinA  is  not  on  the  microarray,  RT-qPCR 
shows  that  THF  induces  gene  expression  of  thmA  by  10.2-fold  (logaFC  =  3.35),  relative  to 
glucose.  The  only  disagreement  between  the  Northern  blot  analysis  perfonned  with  P. 
tetrahydrofuranoxydans  and  the  transcriptional  analyses  perfonned  with  strain  CB1190is  that 
the  aldehyde  dehydrogenase  gene  (Psed_6981),  which  has  been  proposed  to  be  involved  in  the 
conversion  of  4-hydroxybutyraldehyde  to  4-hydroxybutyrate  in  P.  tetrahydrofuranoxydans  and 
Graphium  sp.  (141),  is  up-regulated  during  growth  on  THF  in  strain  CB1 190  and  not  in  P. 
tetrahydrofuranoxydans.  An  alcohol  dehydrogenase  had  been  previously  proposed  to  catalyze 
the  conversion  of  2-hydroxytetrahydrofuran  to  y-butyro lactone  (141).  Among  the  genes  only  up- 
regulated  on  THF  relative  to  pyruvate  is  one  annotated  as  an  alcohol  dehydrogenase  GroES 
domain  protein  (Psed_0131).  Although  the  activity  of  this  alcohol  dehydrogenase  needs  to  be 
confirmed,  this  is  the  first  evidence  of  induced  alcohol  dehydrogenase  gene  expression  during 
THF  degradation.  The  conversion  of  4-hydroxybutyrate  to  succinate  semialdehyde  is  proposed  to 
be  catalyzed  by  the  product  of  at  least  one  of  two  differentially  expressed  genes  on  the  plasmid 
pPSED02,  a  lactate  dehydrogenase  (Psed_6970)  or  hydroxyacid-oxoacid  transhydrogenase 
(Psed_6971). 
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Therefore,  based  on  transcriptomic  microarray  analysis,  the  proposed  THF  degradation 
pathway  starts  with  (i)  the  hydroxylation  of  THF  to  2-hydroxytetrahydrofuran  by  the  THF 
monooxygenase  (, thmADBC ),  (ii)  conversion  of  2-hydroxydtetrahydforuan  to  y-butyrolactone  by 
the  alcohol  dehydrogenase  (Psed_0131),  (iii)  abiotic  hydrolysis  of  y-butyrolactone  to  produce  4- 
hydroxybutyrate,  (iv)  conversion  of  4-hydroxybutyrate  to  succinate  semialedhyde  via  either 
transhydrogenase  (Psed_6971)  or  dehydrogenase  (Psed_6970),  (v)  the  oxidation  of  succinate 
semialdehyde  to  succinate  by  succinate  semiladelhyde  dehydrogenase  (sad),  (vi)  the  succinate 
finally  enters  the  tricarboxylate  cycle.  The  4-hydroxybutyrate  could  potentially  be  polymerized 
to  poly-beta-hydroxybutyrate  (PHB)  since  two  poly-beta-hydroxybutyrate  polymerase  genes 
(Psed_2979  and  Psed_2990)  exist  on  the  strain  CB1190  chromosome. 

Induction  of  thmA  gene  expression  by  dioxane  and  THF 

The  microarray  used  in  this  study  is  missing  a  number  of  genes  from  the  finished  strain 
CB1 190  genome  and  does  not  include  sequences  detennined  to  be  pseudogenes.  The  sequence 
Psed_6976  was  designated  as  a  pseudogene  and  was  subsequently  left  off  the  microarray. 
However,  sequence  analysis  of  the  thm  gene  cluster  detennined  that  the  Psed_6976  shares  98.9% 
amino  acid  identity,  94%  nucleotide  identity,  and  the  same  gene  order  with  the  tetrahydrofuran 
monooxygenase  a-subunit  sequence,  thm  A,  characterized  in  P.  tetrahydrofuranoxydans.  RT-PCR 
analysis  of  the  THF  monooxygenase  genes  thmADBC  (Fig.  6.1)  indicated  that  all  of  the  genes  in 
this  cluster  were  transcribed  during  growth  on  isopropanol,  dioxane,  or  THF.  Amplification  of 
fragment  A  demonstrated  that  Psed_6976  is  not  a  pseudogene  but  an  expressible  thmA  gene.  The 
weak  amplification  of  fragments  B  and  D  that  span  thmAD  and  thmDBC,  respectively,  suggested 
that  the  reductase  component  ( thmD )  is  weakly  co-transcribed  with  its  neighboring  open-reading 
frames.  The  presence  of  fragment  D  shows  that  two  genes  thm  Band  thmC  were  co-transcribed. 
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Figure  6.1.  RT-PCR analysis  ofTHF  monooxygenase  gene  expression. 

Lanes  1-3,  primers  thmA  Forl  and  thmARevwere  used  to  amplify  fragment  A.  Lanes  4-6,  primers 
thmA_  For2  and  thmD_Rev  were  used  to  amplify  fragment  B.  Lanes  7-9,  primers  thmD  For  and 
thmC  Revl  were  used  to  amplify  fragment  C.  Lanes  10-12,  primers  thmB  For  and  thmC_Rev2  were 
used  to  amplify  fragment  D.  RNA  used  in  reactions  shown  in  lanes  1,  4,  7,  and  10  was  purified  from 
isopropanol-grown  cells,  RNA  used  in  reactions  shown  in  lanes  2,  5,  8,  and  11  was  purified  from 
dioxane-grown  cells,  and  RNA  used  in  reactions  shown  in  lanes  3,  6,  9,  and  12  was  purified  from  THF- 
grown  cells.  Lanes  M,  molecular  weight  markers.  No  products  were  detected  in  reactions  lacking  reverse 
transcriptase. 

qRT-PCR  analysis  had  been  used  to  quantify  the  up-regulation  of  thmA  (Psed_6976) 
during  growth  on  dioxane  (Chapter  5).  Here,  qRT-PCR  was  used  to  quantify  the  induction  of 
thmA  gene  expression  after  8  hours  of  exposure  to  dioxane,  THF,  or  propane.  In  cells  exposed  to 
dioxane,  THF,  and  propane,  thmA  was  induced  24,  10,  and  6  fold,  respectively,  relative  to 
glucose  (Fig.  6.2).  A  similar  analysis  was  performed  for  prmA  (Psed_0629),  encoding  propane 
monooxygenase  a-subunit.  Exposure  to  dioxane  and  THF  did  not  lead  to  significant  differential 
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expression  of  prmA,  with  a  differential- fold  expression  of  2  and  1 ,  respectively.  However, 
propane  significantly  induced  the  prmA  gene  (64-fold  relative  to  glucose). 


Figure  6.2.  Induction  of  thmA  gene  expression  in  glucose-grown  strain  CB1190. 

Glucose-grown  cells  were  exposed  to  dioxane,  THF,  or  propane  and  after  8  hours  of  exposure,  RNA  was 
extracted  and  transcripts  were  quantified  using  qRT-PCR.  Fold-differential  expression  values  were 
calculated  relative  to  the  expression  of  cells  exposed  to  glucose  (control)  and  were  normalized  to 
housekeeping  genes. 

Functional  activity  of  thmADBC-expressingR. jostii RHA1  clones 

The  strain  CB1190  gene  cluster  thmADBC  encoding  the  putative  THF  multicomponent 
monooxygenase  was  cloned  and  expressed  in  Rliodococcus  jostii  strain  RHA1  from  the 
Rhodococcus  expression  vector  pTip-QC2  (RHAl/pTip-thfmo).  The  RHAl/pTip-thfmo  clones 
were  grown  in  nutrient  broth  and  expression  was  induced  by  thiostrepton.  Monooxygenase 
activity  was  tested  with  dioxane.  RHAl/pTip-thfmo  transformed  2.5  inM  of  dioxane  into  HEAA 
within  3  days  (Fig.  6.3).  The  generation  of  HEAA  was  stoichiometric  with  dioxane  removal. 
Dioxane  was  not  removed  in  either  abiotic  controls  or  with  the  empty  vector  pTip-QC2- 
containing  RHA1  clone  RHAl/pTip-QC2.  The  compound  HEAA  was  detectable  at  extremely 
low  concentrations  in  the  abiotic  and  pTip-QC2  controls,  indicating  that  HEAA  is  a  minor 
impurity  in  the  dioxane  stock  from  the  manufacturer. 
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■Abiotic  Control  -  1 ,4-dioxane 
■p-Tip  Contorl  -  1,4-dioxane 
■THFMO  clone  -  1,4-dioxane 


■Abiotic  Control  -  HEAA 
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■THFMO  Clone  -  HEAA 


Figure  6.3.  Transformation  of  dioxane  to  HEAA  by  heterologous  strain  CB1190  monooxygenase. 

RHAl/pTip-thfmo,  RHAl/pTip-QC2  empty  vector  control  or  abiotic  control  was  amended  with  dioxane 
and  dioxane  disappearance  and  HEAA  production  was  monitored.  Error  bars  represent  the  range  of 
duplicate  reactions. 

The  RHAl/pTip-thfmo  expression  clone  also  removed  THF,  with  4tnM  of  THF 
disappearing  within  3  days  (Fig.  6.4).  THF  loss  in  the  abiotic  and  pTip-QC2  controls  was 
minimal  and  was  attributable  to  the  volatility  of  THF. 
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Figure  6.4.  Removal  of  TF1F  by  heterologous  strain  CB 1 190  monooxygenase. 

RFlAl/pTip-thfmo,  RFlAl/pTip-QC2  empty  vector  control  or  abiotic  control  was  amended  with  THF  and 
THF  removal  was  monitored.  Error  bars  represent  the  range  of  duplicate  reactions. 

These  experiments  confirm  that  the  strain  CB1190  genes  Psed_6976-6979  encode  a 
multi-component  monooxygenase  that  is  active  with  both  dioxane  and  THF.  This  is  the  first 
ftmctional  expression  of  a  monooxygenase  from  a  dioxane-metabolizing  microorganism  and 
from  any  Pseudonocardia  sp. 

Growth  of  P.  dioxanivorans  strain  CB1190  on  HE  A  A 

Growth  of  strain  CB1190  on  identified  intermediates  of  dioxane,  such  as  ethylene  glycol, 
glycolate,  glycoaldehyde,  glyoxal,  glyoxylic  acid,  oxalic  acid,  and  formic  acid  have  been  confirmed  (91). 
However,  growth  on  2-hydroxy- 1,4-dioxane,  1 ,4-dioxane-2-one  (dioxanone),  HEAA,  and  1,2- 
dihydroxyethoxy-2-hydroxyethoxy  acetic  acid  have  not  been  determined.  We  obtained  HEAA  in  its  salt 
form  in  order  to  test  the  growth  of  strain  CB  1 190  on  HEAA  as  sole  carbon  and  energy  source.  Strain 
CB  1 190  grew  with  HEAA  as  verified  by  visual  inspection  (data  not  shown)  and  HEAA  removal  (Fig. 

6.5).  Starting  with  a  1:500  inoculation  of  strain  CB1190  cells  previously  growing  on  dioxane,  complete 
removal  was  accomplished  within  1 1  days. 
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Figure  6.5.  Degradation  of  HEAA  by  strain  CB 1 190. 

Cells  of  strain  CB1 190  were  grown  in  AMS  medium  amended  with  HEAA  (n=  5). 

Testing  the  effect  of  acetylene  exposure  on  HEAA  degradation 

Mahendra  etal.  (91)  proposed  that  in  the  dioxane  metabolic  pathway  in  strain  CB1190  a 
monooxygenase  is  also  involved  in  the  hydroxylation  of  HEAA  and  that  this  monooxygenase  is 
likely  the  same  enzyme  that  catalyzes  the  initial  hydroxylation  of  dioxane.  However,  alter 
incubating  the  dioxane- degrading  RHAl/pTip-thfmo  with  chemically  synthesized  HEAA,  no 
HEAA  removal  was  observed  in  72  h  (data  not  shown). 

Since  brief  exposure  to  acetylene  gas  causes  irreversible  inhibition  in  some 
monooxygenases  ( e.g .,  (24,  123,  136,  147))  and  specifically  inhibits  co-metabolic  and  metabolic 
degradation  of  dioxane  (89,  91),  dioxane-grown  strain  CB1 190  cells  were  briefly  exposed  to 
acetylene  gas  to  test  if  transformation  of  HEAA  is  indeed  catalyzed  by  a  monooxygenase 
reaction.  As  Fig.  6.6  shows,  brief  exposure  had  minimal  effects  on  HEAA  removal,  with  1.25 
mM  of  HEAA  removed  within  90  h. 
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Figure  6.6.  Test  of  acetylene  on  HEAA  removal  by  strain  CB1 190. 

Dioxane -grown  cells  were  exposed  to  acetylene  and  then  HEAA  removal  was  monitored.  Squares,  abiotic 
samples;  Triangles,  acetylene-exposed  cells;  Circles,  no-acetylene-exposure  cells.  Error  bars  represent  the 
range  of  triplicate  samples. 

These  results  contradict  the  proposal  by  Mahendra  etal.  (91)  that  the  same 
monooxygenase  acts  on  both  dioxane  and  HEAA.  If  a  monooxygenase  does  in  fact  catalyze  the 
transfonnation  of  HEAA  in  strain  CB1190,  this  enzyme  is  different  from  that  encoded  by 
Psed_6976-6979,  and  is  unlike  many  other  monooxygenases  in  that  it  is  resistant  to  irreversible 
inhibition  by  acetylene. 

Conclusion 

This  study  identified  for  the  first  time  genes  used  by  Pseudonocar dia  dioxanivorans 
strain  CB1190  to  grow  with  THF.  The  monooxygenase  involved  in  the  initial  hydroxylation  of 
both  THF  and  dioxane  was  confirmed  by  cloning  and  heterologous  expression  in  R.  jostii  strain 
RHA1.  Experimental  evidence  with  these  RHA1  clones  and  with  the  monooxygenase- inhibiting 
gas  acetylene  contradicts  previous  suggestions  that  the  same  monooxygenase  acts  on  both 
dioxane  and  on  the  dioxane  metabolic  intermediate  HEAA  -  it  appears  that  HEAA  is 
transfonned  via  a  still  unknown  mechanism.  While  substantial  advances  were  achieved  in 
understanding  the  dioxane  metabolic  pathway  both  in  Chapter  5  and  in  the  current  Chapter, 
fiirther  study  is  required  to  understand  how  dioxane  C4  intermediates  are  transfonned  to  C2 
compounds. 
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7.  Using  metabolomics  analyses  to  assist  genome  annotation  of  Pseudonocardia 

dioxanivorans  strain  CB1190 


1.  Results  presented  in  this  section  are  included  in  publications  in  preparation. 

Introduction 

Strain  CB1190  and  genome  annotation 

Pseudonocardia  dioxanivorans  CB1 190  was  previously  isolated  from  industrial  sludge 
contaminated  with  dioxane  (112)  and  it  is  one  of  the  first  bacterium  reported  to  be  capable  of 
growth  on  dioxane  and  the  first  member  of  the  genus  Pseudonocardia  for  which  there  is  an 
annotated  genome  sequence  (132).  In  addition  to  dioxane,  strain  CB1190  is  capable  of  using  a 
broad  spectrum  of  organic  compounds  as  its  sole  carbon  and  energy  source,  including  other 
ethers  (e.g.,  tetrahydrofuran,  diethyl  ether,  and  butyl  methyl  ether),  various  alcohols  and  aromatic 
compounds  (e.g.,  benzene  and  toluene)  (90,  112).  It  can  also  grow  aulotrophically  with  CO2. 

The  ability  to  use  a  wide  variety  of  carbon  sources  is  not  surprising  given  the  large  genome  (7.3 
Mb)  (132).  However,  the  presence  of  multiple  apparent  carbon  utilization  pathways  for  each 
individual  compound,  multiple  gene  copies  of  each  enzyme  in  one  particular  pathway,  and  often 
missing  key  enzymes  in  pathways  create  ambiguities  in  the  genome  annotation.  For  example, 
strain  CB1 190  seems  to  bear  three  CO 2  fixation  pathways  (a  Calvin- Benson-Bassham  (CBB) 
pathway,  a  reverse  citric  acid  cycle,  and  a  Wood-Ljundahl  pathway),  of  which  none  is  complete. 
Therefore,  a  genome  independent  method  is  necessary  to  evaluate  the  strain  CB1190  genome 
annotation  in  order  to  more  precisely  connect  the  genomic  information  and  strain  CB1190’s 
physiological  traits.  In  this  work  we  chose  to  use  C2  metabolic  pathways  as  test  cases  for 
verifying  genome  annotation. 

Ethylmalonyl-CoA  pathway  and  glyoxylate  cycle  pathway 

The  transformation  of  dioxane  by  strain  CB1190  eventually  leads  to  the  production  of 
two  glyoxylate  molecules,  which  are  assimilated  via  the  glyoxylate  carboligase  pathway  (see 
Chapter  5).  Coincidentally,  glyoxylate  is  also  an  important  intennediate  involved  in  aerobic 
bacterial  assimilation  of  acetyl-CoA  (mostly  synthesized  from  Cl  and  C2  compounds  -e.g., 
acetate)  because  more  complex  building  blocks,  such  as  C3  compounds  (e.g.,  pyruvate)  can  only 
be  synthesized  from  acetyl-CoA.  However,  since  pyruvate  synthase  (EC  1.2.7. 1)  is  incapable  of 
synthesizing  pyruvate  directly  from  acetyl-CoA  under  aerobic  condition,  alternative  pathways 
have  to  be  used  to  complete  this  task  in  aerobic  bacteria  (2,  5).  Only  two  aerobic  acetyl-CoA 
assimilation  pathways  are  currently  known:  the  glyoxylate  cycle  and  the  ethylmalonyl-CoA 
pathway  (used  in  acetate  metabolism)  (2,  5,  34-37,  105,  118,  179).  Both  pathways  convert 
acetyl-CoA  to  malate  by  condensation  with  glyoxylate,  and  from  there  pyruvate,  oxaloacetate, 
and  a-ketoglutarate  are  made  by  established  steps  in  carbon  metabolism.  Despite  similar 
substrates  and  products,  these  two  pathways  are  differentiated  in  a  couple  of  ways:  the 
glyoxylate  cycle  generates  reducing  equivalents  as  NADH  while  ethylmalonyl-CoA  pathway 
consumes  reducing  equivalents  as  NADPH,  and  carbon  fixation  happens  in  ethylmalonyl-CoA 
pathway  but  not  in  glyoxylate  cycle  (Fig  7.1)  (2).  A  recent  survey  of  all  sequenced  bacterial 
genomes  revealed  that  while  more  than  30%  of  these  genomes  encode  for  the  glyoxylate  cycle, 
only  7%  of  them  encode  for  the  ethylmalonyl-CoA  pathway,  and  1%  encode  both  pathways  (2). 
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Strain  CB1190’s  genome  seems  to  encode  both  pathways,  but  ambiguities  exist  in  the  genome 
annotation.  For  example,  the  isocitrate  lyase  (EC  4. 1.3.1)  in  the  glyoxylate  cycle  pathway  is 
diverged  from  conventional  isocitrate  lyase  at  the  amino  acid  sequence  level,  and  key  enzymes 
acetoacetyl-CoA  reductase  (EC  1.1.1.36),  mesaconyl- Co  A  hydratase  (EC  4.2.1.-)  and  malate- 
CoA  ligase  (EC  6.2. 1.9)  in  the  ethylmalonyl  CoA  pathway  are  missing  (131). 

Glycine  cleavage  system  and  glycine -glyoxylate  pathway 

As  opposed  to  acetate  metabolism  through  the  ethylmalonyl  CoA  pathway,  glycine 
assimilation  in  aerobic  bacteria  represents  a  non-acetyl-CoA-related  C2  compound  assimilation 
strategy  which  is  generally  achieved  through  the  glycine  cleavage  system  (GCS).  Here,  one 
glycine  molecule  is  cleaved  into  a  CO2  and  a  methyl  group  as  methyl- THF  (tetrahydro folate), 
which  in  turn  reacts  with  a  second  glycine  molecule  to  synthesize  a  serine.  From  the  serine, 
pyruvate  and  other  central  pathway  metabolites  can  be  synthesized  and  used  to  produce  energy 
and  building  blocks. 

Additionally,  some  bacteria  can  generate  glyoxylate  from  glycine  through  the  direct 
oxidation  of  glycine  by  D- amino- acid  oxidase  (EC  1.4.3.19).  The  strain  CB1190  genome 
encodes  a  D-amino-acid  oxidase  homologue.  Since  glyoxylate  metabolism  through  the 
glyoxylate  carboligase  pathway  is  established  in  CB1190  (Chapter  5),  glycine  may  also  be 
metabolized  via  glyoxylate  as  an  important  intermediate. 

Research  questions,  hypotheses  and  important  results 

The  work  presented  here  had  two  motivations.  First,  the  ambiguities  in  the  genome 
annotation  must  be  clarified  so  that  the  genetic  information  can  be  more  effectively  leveraged. 
For  example,  an  unambiguous  genome  annotation  is  needed  to  construct  precise  genome-scale 
models  that  allow  us  to  predict  behaviors  of  strain  CB1190  in  dioxane  bioremediation  processes. 
Therefore,  verifying  the  annotation  of  genes  involved  in  the  assimilation  of  C2  compounds 
represents  a  test  case  for  resolving  genome  annotation  ambiguities.  Second,  from  a  practical 
standpoint,  given  that  the  metabolism  of  environmental  C2  compounds  could  interfere  with 
dioxane  metabolism  (by  virtue  of  their  sharing  the  glyoxylate  carboligase  pathway),  it  is 
important  to  understand  if  the  highlighted  C2  compound  assimilation  pathways  are  in  fact  active 
in  strain  CB1190.  To  these  ends,  in  this  study  we  analyzed  acetate  and  glycine  metabolic 
pathways  in  strain  CB1190  at  the  functional  and  genetic  levels. 

Materials  and  Methods 

Chemicals 

IT  IT  IT  IT 

[1-  Cjsodium  pyruvate,  [1-  Cjsodium  acetate,  [2-  C]sodium  acetate  and  [  Cjsodium 
bicarbonate  (99%  purity)  were  purchased  from  Cambridge  Isotope  Laboratories,  Inc.  (Andover, 
MA). 

Bacterial  strain  and  culture  conditions 

Strain  CB1190  was  routinely  cultivated  in  ammonium  mineral  salts  (ASM)  medium 
(112).  Replicate  250  mL  glass  bottles  with  screw-caps  were  filled  with  AMS  medium  and  were 
amended  with  the  appropriate  carbon  source.  Dioxane-grown  strain  CB1 190  was  used  as 
inoculants  with  1%  inoculation  in  order  to  start  culture  lines  with  respective  carbon  source.  To 
unambiguously  track  the  carbon  transformations  and  incorporations  in  the  metabolic  network, 
the  tracer  experiments  with  respective  isotopic  carbon  sources  (10  -  30  mM)  were  conducted  in 
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duplicates  which  were  [l-l3C]sodium  pyruvate,  [l-l3C]sodium  acetate,  [2-13C]sodium  acetate, 
and  glycine  (50  inM)  with  [13C]sodiuin  bicarbonate.  In  order  to  avoid  unlabeled  carbon 
introduced  from  inoculation,  strain  CB1 190  was  subcultured  with  1%  inoculum  in  ASM  medium 
with  isotopic  carbon  source  three  times  before  being  harvested  for  isotopomer  analysis. 

Analytical  methods 

Analysis  of  amino  acid  isotopomers  was  perfonned  as  previously  described  (149,  184). 

RNA  extraction  for  transcription  studies 

Cells  were  harvested  when  approximately  half  of  the  added  substrate  had  been  consumed. 
Cells  from  replicate  bottles  were  collected  by  filtration,  were  scraped  from  the  filters  with  a 
sterile  scalpel,  and  were  transferred  to  2  mL  screw-top  microcentrifuge  tubes  containing  1  g  1 00- 
pun-  dia me  ter  zirconia  beads  (Biospec  Products,  Bartlesville,  OK).  These  tubes  were  stored  at  - 
80°C  until  use.  Nucleic  acids  were  extracted  using  a  modified  version  of  the  phenol  method 
described  previously  (60).  DNA  was  removed  by  treatment  with  DNase  I,  and  RNA  was  finally 
purified  with  an  RNeasy  Kit  (Qiagen). 

Reverse  transcript-PCR 

cDNA  was  synthesized  from  RNA  using  the  TaqMan  reverse  transcription  kit  (Applied 
Biosystems,  Foster  City,  CA)  with  random  hexamers  according  the  manufacturer’s  protocoL 
Synthesized  cDNA  was  used  as  template  in  each  subsequent  PCR  reaction.  All  PCR  reactions 
were  perfonned  with  gDNA  as  positive  control,  milliQ-FBO  as  no  template  negative  control  and 
negative  controls  using  cDNA  synthesized  from  target  gene  absent  conditions.  0.8%  agarose  gel 
was  used  to  separate  the  PCR  products  and  evaluate  the  results. 

Cell-free  extracts  and  enzyme  measurement 

Strain  CB1190  cells  grown  on  respective  carbon  source  were  collected  by  filtration, 
scraped  and  stored  in  2  mL  microcentrifuge  tubes  at  -80°C.  Zirconia  beads  were  added,  and  the 
cells  were  resuspended  ini 00  inM  Tris*HCI  (pH  8.2)  buffer.  Cells  were  broken  by  bead- beating, 
and  then  debris  was  collected  by  centrifugation.  The  clear  supernatant  was  transferred  and 
represented  the  cell- free  extract.  This  extract  was  stably  stored  at  4°C  for  a  week.  The  protein 
content  was  detennined  by  using  the  Bradford  method  with  BSA  as  the  standard.  Citrate 
synthase  (84),  isocitrate  lyase  (95),  acetyl-CoA  C-acetyltransferase  (3,  33,  35),  and  malate 
synthase  (3,  33,  35)  activities  were  tested  according  to  published  methods. 

Results  and  Discussion 

13C  labeled  pyruvate  to  evaluate  annotation  of  central  pathways 

In  order  to  verify  active  central  metabolic  pathways,  strain  CB1 190  was  grown  using  13C-labeled 
pyruvate  since  pyruvate  plays  pivotal  roles  in  central  metabolisms.  Labeling  patterns  of  heavy 
carbon  (13C)  in  each  amino  acid  were  verified  and  revealed  that  strain  CB1 190  has  a  complete 
and  active  central  metabolic  pathway,  including  glycolysis  and  citrate  cycles.  The  C-pyruvate 
studies  indicated  the  presence  of  an  active  glyoxylate  cycle  for  quickly  replenishing  the  citrate 
cycle  metabolites  (such  as  malate),  which  are  withdrawn  for  building  block  synthesis.  This  was 
surprising,  since  the  gene  for  isocitrate  lyase,  a  key  enzyme  that  cleaves  isocitrate  into  glyoxylate 
and  succinate,  appeared  to  be  absent  in  the  strain  CB1 190  genome.  However,  a  recent  re¬ 
annotation  of  the  genome  data  revealed  the  presence  of  a  putative  isocitrate  lyase  homologue 
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(Psed_4635),  for  which  the  amino  acid  sequence  is  distant  from  regular  bacterial  isocitrate  lyases. 
This  ambiguity,  along  with  other  reasons  mentioned  earlier  in  the  introduction,  motivated  us  to 
conduct  further  analyses  with  the  aid  of  stable  isotopic  analysis. 

Demonstration  of  acetate  assimilation  using  stable  isotopic  analyses 

[1-13C]  and  [2-13C]  sodium  acetate  were  used  to  probe  active  acetyl-CoA  assimilation 
pathways  in  strain  CB1190.  Several  representative  stable  isotopic  labeling  profiles  were  obtained 
with  these  substrates  (Table  7.1).  The  [M-159]  labeling  profiles  of  alanine,  serine  and  glycine 
indicated  that  pyruvate  was  not  directly  synthesized  from  acetyl-CoA.  The  reason  is  that  a  major 
portion  of  the  [M-159]  fragment  from  all  three  amino  acids  was  not  enriched  with  heavy  carbon, 
which  would  have  been  highly  enriched  if  pyruvate  was  directly  synthesized  from  acetyl-CoA. 
This  conclusion  agrees  with  the  knowledge  that  pyruvate  synthase  (EC  1.2. 7.1)  requires  alow 
redox  potential  in  order  to  catalyze  the  carboxylation  reaction;  therefore  aerobic  bacteria  cannot 
normally  use  this  reaction  to  synthesize  pyruvate  from  acetyl-CoA.  This  result  also  indicated 
that  either  the  glyoxylate  or  the  ethylmalonyl-CoA  pathway  for  acetyl-CoA  assimilation  might 
be  functional  in  strain  CB1190. 


Table  7.1.  Amino  acids  l3C  stable  isotopic  labeling  profiles  used  to  identify  acetate  assimilation 
pathways  in  strain  CB1190. 


[1-13C]  acetate  grown  cells 

[M-57]  molecule 


[2-13C]  acetate  grown 
cells 

[M-57]  molecule 


Mass 

Ala 

Ser 

Chi 

Asp 

Amino  Acids 

Thr  Met 

Ala 

Ser 

ay 

M+0 

0.32 

0.30 

0.35 

0.06 

0.04 

0.04 

0.00 

0.01 

0.07 

M+l 

0.51 

0.55 

0.59 

0.55 

0.54 

0.49 

0.20 

0.17 

0.63 

M+2 

0.17 

0.15 

0.06 

0.32 

0.33 

0.35 

0.51 

0.55 

0.30 

M+3 

0.07 

0.08 

0.11 

0.28 

0.26 

In  order  to  detennine  which  acetyl-CoA  assimilation  pathway  is  used  by  strain  CB1190, 
we  chose  amino  acids  that  are  directly  derived  from  the  end  products  of  the  glyoxylate  cycle  or 
ethylmalonyl-CoA  pathway  (malate  and  succinate,  respectively)  as  indicators  to  interpret  the 
results.  Isotopic  labeling  profiles  of  the  [M-57]  fragment  for  the  amino  acids  aspartate,  threonine, 
and  methionine  are  listed  in  Table  7.1.  For  these  amino  acids,  -10%  of  their  molecules 
contained  three  heavy  13C  carbons.  Considering  that  the  glyoxylate  cycle’s  overall  reaction  can 
be  s  imp  I  i  lied  as  synthesizing  a  molecule  of  malate  from  two  molecules  of  acetyl-CoA,  the 
maximum  labeled  carbons  in  malate- derived  amino  acids  such  as  aspartate  and  threonine  can 
only  contain  two  or  fewer  heavy  carbons  with  respect  to  both  [l-13C]and  [2- 13C]sodium  acetate 
as  sole  carbon  source.  Given  10%  of  the  aspartate  and  threonine  from  strain  CB1190  contained 
three  heavy  carbons,  ethylmalonyl-CoA  pathway  is  more  likely  used  to  assimilate  acetyl-CoA  by 
strain  CB1190.  However,  from  looking  exclusively  at  labeling  profiles,  we  could  not  rule  out  the 
possibility  that  glyoxylate  cycle  might  also  be  active  during  the  growth,  even  though  the 
simultaneous  use  of  divergent  carbon  assimilation  pathways  for  a  single  substrate  is  rare  in 
bacteria.  Therefore,  we  employed  enzyme  assays  and  reverse  transcript-PCR  (RT-PCR)  in  order 
to  provide  additional  evidence  for  detennining  which  acetyl-CoA  pathway(s)  is  used  by  strain 
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Figure  7.1.  Schematic  ofthe  ethylmalonyl-CoA  pathway  in  CB1190  with  possible  genes  encoding 
the  enzyme  assigned  to  each  step.  Genes  missing  from  the  genome  are  in  red. 
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In  enzyme  assay  experiments,  we  chose  isocitrate  lyase  from  the  glyoxylate  cycle  and 
acetyl-CoA  C-acetyltransferase  (EC  2.3. 1.9)  and  L-malyl-CoA/beta-methylmalyl-CoA  lyase  (EC 
4.1.3.24)  from  the  ethylmalonyl-CoA  pathway  as  representative  enzymes  to  determine  the 
activity  of  these  pathways  during  strain  CB1 190  growth  on  acetate.  Citrate  synthase  activity  was 
used  as  a  positive  control  for  the  activity  of  cell- free  extracts.  In  isocitrate  lyase  assays,  cell- free 
extracts  from  E.  coli  cells  grown  on  acetate  as  a  sole  carbon  and  energy  source  were  used  as 
positive  controls.  In  addition,  extracts  from  glycine-grown  strain  CB1 190  cells  were  used  as 
negative  controls. 

Isocitrate  lyase  activity  was  not  detected  in  acetate-grown  CB1190  cell- free  extracts  (data 
not  shown),  which  indicated  that  the  glyoxylate  cycle  was  probably  not  ftmctional  in  strain 
CB1 190  under  the  experimental  conditions  tested. 

For  the  acetyl-CoA  C-acetyltransferase  and  L-malyl-CoA/beta-methylmalyl-CoA  lyase 
assays,  extracts  from  glycine-grown  strain  CB1 190  cells  were  again  used  as  negative  controls. 
Activity  for  both  of  these  enzymes  was  positively  detected  in  extracts  from  acetate- grown 
CB1190.  In  addition  to  the  stable  isotopic  analyses  and  enzyme  assays,  we  employed  RT-PCR  to 
determine  whether  pathway- specific  genes  were  expressed  during  the  growth  of  strain  CB1190 
on  acetate.  Total  RNA  from  acetate-grown  cells  was  converted  to  cDNA  and  used  in  PCR 
reactions.  Strain  CB1190  genomic  DNA  was  used  as  a  positive  control  for  the  PCR  reactions, 
while  RNA  from  glycine- grown  cells  was  used  as  a  negative  control  All  of  the  key  genes 
selected  from  the  ethylmalonyl-CoA  pathway  were  detected  in  cDNA  derived  from  acetate- 
grown  cells,  with  the  exception  of  the  genes  encoding  3-hydroxybutyryl-CoA  dehydratase  (EC 
4.2.1.55)  and  malate-CoA  ligase  (EC  6. 2. 1.9). 

The  results  demonstrate  that  the  ethylmalonyl-CoA  pathway,  which  was  recently 
identified  in  acetate-grown  Rhodobacter  sphaeroides,  is  used  by  strain  CB1 190  to  incorporate 
acetate.  While  glyoxylate  is  a  metabolite  in  dioxane  degradation,  it  is  also  one  of  the  key 
intennediates  in  the  ethylmalonyl-CoA  pathway,  as  well  as  in  the  glyoxylate  cycle.  The 
glyoxylate  cycle  is  commonly  used  by  aerobic  acetate  utilizing  microorganisms,  but  it  appears  to 
not  be  active  in  strain  CB1190  during  growth  with  acetate.  The  ethylmalonyl-CoA  pathway  uses 
three  molecules  of  acetyl-CoA  (generated  directly  from  acetate)  and  two  molecules  of  carbon 
dioxide  to  produce  one  molecule  of  malate  and  one  molecule  of  succinate.  Malate  and  succinate 
are  then  incorporated  into  the  central  tricarboxylic  acid  cycle  (TCA  cycle)  to  produce  anaplerotic 
and  gluconeogenic  building  blocks.  The  13C-labeling  protile  indicated  that  the  two  carbon 
dioxide  molecules  fixed  along  the  ethylmalonyl-CoA  pathway  were  predominantly  from  acetate 
oxidation.  It  is  not  yet  known  if  the  two  carbon  dioxide  molecules  could  alternatively  come  from 
strain  CB1 190 ’s  ambient  environment  with  elevated  C02  level  as  a  carbon  sequestration  process. 

Demonstration  of  glycine  assimilation  in  strain  CB1190  using  stable  isotopic  analyses 

As  mentioned  earlier,  glycine  assimilation  in  CB1 190  can  also  involve  glyoxylate  as  an 
important  intermediate,  via  the  activity  of  D- amino  acid  oxidase.  Alternatively,  the  glycine 
cleavage  system  (GCS)  might  be  used  to  assimilate  glycine.  Unfortunately,  with  respect  to  the 
building  blocks  with  higher  carbon  numbers  (such  as  alanine  or  serine),  both  pathways  give 
identical  stable  isotopic  labeling  patterns,  regardless  of  what  type  of  labeled  glycine  is  used. 

This  is  because  both  pathways  employ  similar  carbon  condensation  strategies  to  make  the  key  C3 
metabolic  intennediate  pyruvate  from  two  C2  compounds  (glycine  in  this  case).  Therefore,  in 
order  to  differentiate  these  pathways,  a  novel  13C  stable  isotopic  analysis  method  was  necessary 
Fig.  7.2).  Since  enzymes  involved  in  GCS  are  highly  reversible  (they  are  also  involved  in  the 
synthesis  and  utilization  of  Cl  compounds),  we  amended  AMS  medium  with  13C  labeled  sodium 
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bicarbonate  along  with  the  glycine  as  carbon  source.  If  the  13C  labeled  bicarbonate  can  be 
integrated  into  glycine  and  other  amino  acids,  this  means  the  GCS  pathway  is  active  under  the 
cultural  conditions  provided. 

Tables  2A  and  2B  shows  that  the  first  carbon  of  glycine,  serine,  alanine  were  enriched 
with  heavy  carbons,  which  could  only  come  from  the  13C  labeled  bicarbonate.  In  addition, 
labeling  profiles  of  more  complex  amino  acids  such  as  aspartate,  threonine  and  methionine  also 
confirmed  the  conclusion  that  a  GCS  was  ftmctional  in  strain  CB1190.  Genes  potentially 
involved  in  the  GCS,  as  identified  by  bioinlbrmatic  analysis,  are  shown  in  Fig.  7.3 

It  should  be  noted  that  since  carbon  reuse  is  a  common  feature  in  aerobic  bacteria,  only 
about  15%  of  alanine  and  serine  was  enriched  with  heavy  carbon  from  exogenous  bicarbonate. 
This  data  is  not  sufficient  to  conclude  that  the  GCS  is  the  sole  pathway  used  by  strain  CB1 190 
during  the  glycine  assimilation. 
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Figure  7.2. 
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C  stable  isotopic  analysis  method  used  to  differentiate  glycine  assimilation  pathways. 


Table  7.2  A.  Amino  acids  13C  stable  isotopic  labeling  profile  s  used  to  identify  glycine  as  similation 
pathways  in  strain  CB 1190. 
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Table  7.2B.  Amino  acids  13C  stable  isotopic  labeling  profiles  used  to  identify  glycine  assimilation 
pathways  in  strain  CB 1190. 
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Figure  7.3.  Schematic  ofthe  glycine  cleavage  system  pathway  in  strain  CB1190  with  possible  genes 
encoding  the  enzyme  assigned  to  each  step. 

Conclusion 

This  study  attempted  to  confirm  and  capitalize  on  the  recent  sequencing  and  annotation 
ofthe  Pseudonocardia  dioxanivorans  strain  CB1 190  genome.  We  focused  on  the  metabolism  of 
specific  C2  compounds,  with  the  compound  glyoxylate  being  an  intermediate  that  unifies  the 
metabolic  pathways  studied  here  and  the  dioxane  metabolic  pathway  examined  in  Chapter  5.  We 
demonstrated  for  the  first  time  that  the  ethylmalony-CoA  pathway  is  ftinctional  when  strain 
CB1190  is  provided  sodium  acetate  as  the  sole  carbon  and  energy  source,  even  though  several 
key  enzyme-encoding  genes  are  missing  from  the  genome  annotation.  In  addition,  our  results 
indicate  that  the  glycine  cleavage  system  is  likely  used  by  strain  CB1190  to  degrade  and 
assimilate  glycine.  However,  it  is  unclear  if  the  glycine- glyoxylate  conversion  system  is  also 
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fiinctional  when  glycine  is  provided  as  the  sole  carbon  and  energy  source.  For  both  pathways  the 
detection  of  metabolic  activities  contributes  to  the  confirmation  of  the  annotation  of  genes 
putatively  involved  in  these  pathways. 

This  work,  along  with  the  application  of  the  technique  in  Chapter  5,  demonstrates  the 
utility  of//;  vivo  '"’C- labeled  metabolite  analysis  for  studying  fiindamental  metabolic  pathways. 
This  technique  provides  complementary  and  direct  information  for  genomic  and  transcrip  to  mic 
analysis  of  cellular  metabolism,  and  would  be  highly  applicable  for  confirming  the  interesting 
organic  substrate  degradation  pathways  described  in  Chapter  4. 
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8.  Towards  development  of  a  genetic  system  for  Pseudonocardia  dioxanivorans 

strain  CB1190 


Introduction 

The  ability  to  manipulate  DNA  in  a  microorganism  is  desirable  for  two  main  reasons. 
First,  interrupting  genes  is  traditionally  the  pre-eminent  method  for  linking  genetic  and 
phenotypic  or  metabolic  traits.  Second,  removing,  adding,  or  customizing  a  microorganism’s 
genetic  complement  pennits  manipulation  of  metabolism  in  order  to  achieve  a  desired  output. 
Therefore,  the  ability  to  manipulate  a  microorganism’s  DNA  has  impact  on  both  understanding 
and  engineering  biology. 

The  capacity  to  use  dioxane  as  a  sole  source  of  carbon  and  energy  is  relatively  rare 
among  tested  bacterial  isolates  (for  example,  see  Chapter  3).  Pseudonocardia  dioxanivorans 
strain  CB1190  does  possess  this  capacity,  and  it  is  the  first  microorganism  that  metabolizes 
dioxane  for  which  the  genome  has  been  sequenced  and  annotated  (see  Chapter  4).  Attempts  to 
link  this  capacity  to  specific  genes  in  order  to  better  understand  dioxane  metabolism  were  made 
in  Chapters  4,  5  and  6,  by  using  bioinformatic  analyses,  gene  expression  assays  (microarrays  and 
qRT-PCR),  and  heterologous  expression  of  strain  CB1190  genes.  However,  none  of  these 
methods  provided  conclusive  proof  of  the  involvement  and  strict  requirement  of  candidate  genes 
in  dioxane  metabolism.  The  ability  to  knock  out  a  candidate  gene,  observe  loss  of  the  ability  to 
metabolically  degrade  dioxane,  and  then  observe  the  restoration  of  dioxane  metabolism  through 
the  addition  of  the  candidate  gene  in  l rans,  was  desired.  For  this  reason  we  endeavored  to 
develop  a  genetic  system  for  strain  CB1190. 

Materials  and  Methods 

Antibiotic  resistance  testing  of  CB1190 

CB1190  was  grown  on  basal  salts  medium  (BSM)  with  tetrahydrofuran  (THF)  as 
previously  described  (112).  These  cultures  were  used  to  inoculate  tubes  containing  and  the 
appropriate  antibiotic.  Tubes  were  incubated  at  30  °C  with  shaking  and  cell  growth  was  visually 
monitored  and  recorded  daily. 

Conjugative  matings  with  E.  coli  to  introduce  plasmids  into  CB1190 

Conjugative  matings  with  E.  coli  to  introduce  plasmids  into  CB1190  were  carried  out  on 
R2A  plates  at  30°C  for  48  h.  Matings  using  a  variety  of  ratios  of  donor  and  recipient  cells  were 
tested.  Mating  mixtures  were  suspended  in  BSM  and  dilutions  were  plated  on  BSM  plates 
containing  appropriate  antibiotics.  THF  was  provided  as  the  carbon  source  in  the  vapor  phase. 

Transposon  mutagenesis  ofCB1190 

In  vitro  transposition  of  the  EZ-Tn5  <kan2>  transposon  into  the  CB1190  chromosome 
was  carried  out  using  the  EZ-Tn5  <kan2>  transpososome  system  (Epicentre  Biotechnologies) 
following  manufacturer’s  directions.  Electroporation  conditions  were  also  as  described  in  the 
manufacturer’s  instructions. 
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Preparation  of  electrocompetent  cells 

CB1190  actively  growing  in  BSM  containing  THF  was  used  to  inoculate  a  fresh  culture 
in  the  same  medium  in  a  125  ml  growth  flask.  Cells  were  grown  at  30  °C  with  shaking.  THF 
was  added  every  other  day.  After  5-8  days,  cells  were  harvested  by  centrifugation,  washed 
three  times  with  sterile  deionized  water  and  resuspended  in  a  final  volume  of  10  ml  of  deionized 
water.  Cells  were  stored  at  -  80  °C. 

Electroporation  of  CB1190 

Electrocompetent  CB1 190  cells  were  thawed  on  ice.  After  thawing,  cells  were  mixed 
well  using  a  vortex  mixer  and  the  remaining  clumps  of  cells  were  allowed  to  settle  lor  2  minutes. 
Avoiding  cell  clumps,  50  pi  of  the  cell  suspension  was  transferred  to  1.5  ml  microfuge  tube. 
Plasmid  DNA  (1-2  pi)  was  added  and  the  suspension  was  mixed  and  incubated  on  ice  for  20 
minutes.  The  cell/DNA  suspension  was  transferred  to  a  chilled  electroporation  cuvette  and 
placed  on  ice.  The  cells  were  then  electroshocked  and  1  ml  of  ice  cold  R2A  was  immediately 
added  to  the  cuvette  and  placed  on  ice.  The  cell  suspensions  were  transferred  to  a  test  tube  and 
incubated  at  30°C  for  12  hrs  without  shaking.  Cells  were  then  plated  onto  R2A  plates  containing 
25  pg/m  kanamycin  and  incubated  at  30  °C.  Colonies  were  visible  within  5- 10  days. 

Results  and  Discussion 

We  obtained  seven  different  plasmid  vectors  that  function  in  various  Gram  positive 
bacteria:  pTRWIO  (erythromycin  resistance  [Er] ),  pTRKH2  (Er),  pTRKL2  (Er),  pSA3  (Er, 
tetracycline  resistance  [Tc],  chloramphenicol  resistance),  pTB21  (hygromycin  resistance  [Hg]), 
pMyVecl  (kanamycin  resistance)  and  pMyVec2  (Hg).  CB1190  was  found  to  be  sensitive  to  5 
micrograms/ml  hygromycin,  kanamycin,  chloramphenicol,  and  erythromycin,  and  10 
micrograms/ml  tetracycline.  No  growth  on  any  of  the  selective  plates  was  observed  for  over  one 
week,  while  growth  on  media  lacking  antibiotics  was  observed  within  24  h. 

E.  coli  strain  SI  7-1  was  transformed  with  each  of  the  seven  plasmids.  Conjugative 
matings  between  SI  7-1  harboring  each  of  the  plasmids  and  CB1 190  were  attempted  in  order  to 
introduce  the  plasmids  into  CB1 190;  however,  no  antibiotic  resistant  transfonnants  of  CB 1190 
were  obtained  in  many  attempts. 

We  tested  the  Transpososome  system  from  Epicentre  Biotechnologies  for  in  vitro 
transposition  of  the  EZ-Tn5  <kan2>  transposon  into  the  CB1190  chromosome.  Successful 
introduction  of  the  transposon  could  be  used  to  generate  random  mutant  libraries  with  marked 
mutations.  Mutant  libraries  could  then  be  screened  lor  phenotypes  of  interest.  We  were  able  to 
verily  the  stable  introduction  of  the  EZ-Tn5  <kan2>  transposon  into  the  CB1190  genome; 
however,  we  only  obtained  5-30  kanamycin- resistant  CB1190  colonies  from  each  experiment. 
Therefore,  further  optimization  will  be  required  lor  this  to  be  a  useful  technique.  However,  the 
use  of  the  optimized  electrocompetent  cells  described  below  is  expected  to  result  in  a  significant 
improvement  in  efficiency. 

Various  electroporation  conditions  were  tested  with  the  seven  different  plasmids  listed 
above  but  no  antibiotic-resistant  transformants  of  CB1 190  were  obtained.  These  results 
suggested  that  either  the  plasmid  DNA  did  not  enter  the  CB1190  cells  (i.e.,  the  cells  were  not 
electrocompetent),  the  plasmids  were  not  stably  maintained  in  CB1190,  or  the  genes  required  for 
antibiotic  resistance  were  not  being  expressed.  Since  we  had  been  able  to  show  that  the 
kanamycin  resistance  gene  from  the  EZ-Tn5  <kan2>  transposon  could  be  successfully  expressed 
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in  CB1190,  we  decided  to  concentrate  our  efforts  onpMyVecl,  which  carries  a  kanamycin 
resistance  gene.  After  many  attempts  to  optimize  conditions  for  electroporation  we  were  finally 
able  to  introduce  pMyVecl  into  CB1190  as  indicated  by  acquisition  of  kanamycin  resistance 
(Table  8.1).  The  important  change  in  the  preparation  of  electrocompetent  CB1190  cells  was  to 
resuspend  cells  in  distilled  water  rather  than  solutions  containing  glycerol,  which  is  the 
commonly  reported  method.  Whole  cell  PCR  using  primers  to  amplify  the  kanamycin  resistance 
gene  as  well  as  purification  of  pMyVecl  from  the  kanamycin  resistant  CB1 190  recombinants 
demonstrated  that  pMyVecl  had  been  stably  introduced  into  strain  CB1190.  These  results  also 
indicate  that  any  nucleic  acid  restriction  systems  that  may  be  present  in  strain  CB1190  are  not 
frilly  effective  at  eliminating  non- native  DNA. 


Table  8.1.  Results  ofCB1190  electroporation  with  pMyVe  cl. 

Results  are  averages  from  three  independent  experiments.  pMyVecl  concentration  was  50  ng/uL 


Cuvette 

Gao  width 

(cm) 

kV 

□F 

0.1 

2.5 

25 

0.1 

2.5 

25 

0.1 

2.5 

25 

Is 

CFU/ne 

n 

(ms) 

DNA 

200 

4.9 

0.62 

400 

9.5 

1.80 

800 

15.7 

0.95 

Conclusion 

This  is  the  first  report  of  the  successful  introduction  of  foreign  DNA  into  P. 
dioxanivorans  strain  CB1190.  The  uptake  of  transpososomes  creates  the  ability  lor  random 
mutagenesis  in  order  to  discover  previously  unknown  genes  required  for  specific  cell  functions. 
The  uptake  of  plasmids  will  make  possible  the  functional  characterization  of  genes  in  CB1190 
using  both  directed  mutagenesis  and  heterologous  gene  expression.  Future  needs  include 
developing  techniques  for  plasmid-based  directed  mutagenesis  in  order  to  specifically  knock  out 
genes,  and  detenuining  if  uptaken  plasmids  can  act  as  platfomis  for  gene  expression  in  strain 
CB1190  lor  complementing  knocked-out  genes. 
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9.  Implications  of  research  for  NDMA  and  dioxane  bio  remediation 


The  work  presented  here  confirms  our  hypothesis  that  oxygenases,  specifically 
monooxygenases,  are  important  in  the  biodegradation  of  both  NDMA  and  dioxane.  NDMA  has 
not  yet  been  shown  to  support  bacterial  growth,  and  only  a  few  microorganisms  have  been  found 
to  metabolize  dioxane.  Therefore,  the  growth  of  contaminant  co-metabolizing  microorganisms, 
and  perhaps  the  induction  of  contaminant-transforming  monooxygenases,  will  be  dependent  on 
the  addition  of  a  primary  substrate.  For  NDMA  degradation,  propane  may  be  an  ideal  primary 
substrate  due  to  its  low  cost,  lack  of  toxicity  and  demonstrated  induction  of  NDMA- active 
monooxygenases.  NDMA  concentrations  can  be  reduced  to  ng/L  concentrations  by  co-metabolic 
biodegradation. 

For  dioxane  degradation,  in  the  absence  of  dioxane-metabolizing  microorganisms  such  as 
Pseudonocar dia  dioxanivorans  strain  CB1190,  a  variety  of  primary  substrates  may  lead  to  co- 
metabolic  dioxane  degradation,  including  methane,  toluene  and  THF.  While  some  of  these 
compounds  may  be  toxic  and  therefore  undesirable  to  release  into  the  environment,  they  may 
already  be  present  at  some  dioxane-polluted  sites  as  co-contaminants. 

The  potential  for  the  inhibition  of  NDMA  and/or  dioxane  biodegradation  by  other 
compounds  may  be  a  concern  at  contaminated  sites.  NDMA  transformation,  which  is  induced  by 
the  addition  of  propane  in  tested  bacteria,  is  inhibited  by  high  propane  concentrations  in  a 
competitive,  reversible  manner.  Therefore,  an  appropriate  propane  concentration  must  be  added 
to  induce  NDMA-degrading  microorganisms  and  enzymes  but  not  completely  block  NDMA 
transfonnation.  A  benefit  of  co-metabolic  biodegradation,  however,  is  that  upon  removal  by 
metabolism  of  the  primary  substrate  the  microorganism  often  retains  transfonnation  capacity  for 
the  secondary  substrate,  so  NDMA  may  be  transfonned  even  once  the  added  propane  is 
consumed. 

The  presence  of  TCA  and  DCE  at  a  site,  which  are  common  co-contaminants  with 
dioxane,  is  a  concern  both  because  of  the  human  toxicity  of  these  compounds  and  their  inhibitory 
effects  on  dioxane  biodegradation.  However,  our  results  indicate  the  ability  to  co-metabolically 
degrade  dioxane  may  be  conelated  to  co-metabolic  TCA  and  DCE  transfonnation.  Therefore,  the 
induction  of  co-metabolic  dioxane- degrading  microorganisms  with  an  appropriate  primary 
substrate  may  in  fact  lead  to  the  removal  of  all  three  co-contaminants. 

Since  dioxane  can  in  fact  support  growth  and  proliferation  of  dioxane- degrading 
microorganisms,  an  alternative  primary  substrate  may  not  be  necessary  to  achieve  dioxane 
removal  at  site.  However,  work  presented  here  with  strain  CB1190  indicates  that  the  co¬ 
contaminants  TCA  and  (especially)  DCE  are  strong  inhibitors  of  dioxane  degradation  and  that 
this  bacterium  is  unable  to  transform  either  chlorinated  solvent.  Therefore,  at  dioxane  sites  with 
TCA  or  DCE  present,  the  preferred  strategy  could  be  to  stimulate  co-metabolic  dioxane 
degradation,  which  could  be  accompanied  by  fortuitous  TCA  and  DCE  removal. 

The  genome  sequence  of  strain  CB1190  indicates  that  this  bacterium  has  a  wide 
metabolic  substrate  range,  which  could  be  beneficial  for  the  growth  and  proliferation  of  this 
dioxane-metabolizing  bacterium  at  sites  contaminated  with  low  dioxane  concentrations  that 
poorly  support  growth.  It  is  not  yet  known  what  concentration  of  dioxane  triggers  induction  of 
enzymes  necessary  for  dioxane  transfonnation,  but  the  metabolic  diversity  of  this  microorganism 
may  make  it  an  ideal  candidate  for  the  bioremediation  of  dioxane  and  hydrocarbon-contaminated 
groundwater. 
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